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^  “  Abstract 

Wa  coasideE,the  parallel  time  c(^mplexity  of  logic  programs  without  function  symbols, 
called  logical  query  programs.  We  give  a  PRAM  algorithm  for  computing  the  minimum 
model  of  a  logical  query  program,  and  show  that  for  programs  with  the^polynomiaU 
fringe  property,”  this  ^algorithm  runs  in  logarithmic  time.  As  a  result,  the  ‘"linear” 
and  “piecewise  linear”  classes  of  logic  programs  are  in  MC.  Then  we  examine  several 
nonlinear  cases  in  which  the  program  has  a  single  recursive  rule  that  is  an^elementary 
chain.”  Among  such  "elementary  single  rule*  programs  that  are  nonlinear,  som$  can 
easily  be  shown  to  have  an  equivalent  linear  program,  hence  are  in  MC.  Ws-sta^  that 
certain  nonlinear  programs  are  related  to  GSM  mappings  of  a  balanced  parentheses 
language,  and  that  this  relationship  implies  the ‘^polynomial  fringe  property;”^  hence 
such  programs  are  in  >/C.  Finally,  we  describe  an  approach  for  demonstrating  that 
certain  logical  query  programs  are  log  space  complete  for  P,  imd  apply  it  to  both 
elementary  single  rule  programs  and  nonelementary  programs. 

1  Introduction  ^ 

We  consider  the  parallel  time  complexity  of  logic  programs  without  function  symbols, 
called  logical  query  programs.  Essentially,  such  programs  add  the  power  of  a  least  fix- 
point  operator  to  relational  algebra.  With  the  development  of  efficient  implementations 
of  Prolog  and  the  possibility  of  massive  parallel  computation  through  advances  in  VLSI 
techniques,  there  has  been  increased  interest  in  the  power  of  this  extension.  It  is  well 
known  that  the  least  fix-point  operator  is  a  genuine  extension  to  the  expressive  power  of 
the  monotonic  subset  of  relational  algebra  that  excludes  set  difference  [AU79],  and  that 
queries  expressible  in  this  extended  language  (i.e.,  queries  representable  by  function-free 
Horn  clause  query  programs)  are  computable  in  time  that  is  polynomial  in  the  size  of  the 
database  [CH82].  The  parallel  complexity  of  such  programs  was  largely  unknown  [Var85]. 

‘Supported  by  NSF  grant  IST-84-12791,  a  grant  of  IBM  Corp.,  and  ONR  contract  N00014-85-C-0731. 
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Some  of  these  logical  query  programs  involve  a  thinly  disguised  transitive  closure,  which  is 
known  to  be  in  UC.  Prominent  among  these  are  programs  employing  linear  recursion.  A 
program  is  said  to  have  linear  recursion  when  no  rule  has  more  than  one  recursive  subgoal. 

The  classification  of  programs  employing  "inherent”  nonlinear  recursion,  i.e.,  those 
that  cannot  be  imitated  by  a  program  with  linear  recursion,  has  been  an  unresolved 
question.  We  shed  some  light  on  the  nonlinear  recursion  question  in  this  paper.  We  present 
new  PRAM  algorithms  for,  and  define  the  “polynomial  fringe  property”  for  logical  query 
programs.  In  our  first  main  result,  the  Polynomial  Fringe  Theorem,  we  show  that  logical 
query  programs  with  this  property  are  in.  MC,  hence  permit  very  fast  parallel  computation, 
given  enough  resources.  Our  second  result,  the  GSM  Mapping  Theorem,  is  that  certain 
programs  with  "inherent”  nonlinear  recursion  have  the  polynomial  fringe  property;  hence 
such  progr2uns  are  in  J/C. 

The  remainder  of  the  paper  is  organized  as  follows:  in  Sections  2  and  3  we  give  basic 
definitions  and  define  the  Basic  Theorem  Problem,  by  which  we  measure  the  complexity 
of  a  logical  query  program.  In  Section  4  we  review  the  semantics  of  Horn  logic  programs, 
minimum  models,  and  derivation  trees.  In  Sections  5  and  6  we  present  new  PRAM 
algorithms  and  prove  our  first  main  result,  the  Polynomial  Fringe  Theorem.  In  Section  7 
we  mention  some  easy  consequences  of  the  Polynomial  Fringe  Theorem,  some  of  which 
were  already  known.  In  Section  8  we  prove  our  second  main  result,  the  GSM  Mapping 
Theorem.  In  Section  9  we  show  that  slight  changes  to  programs  that  are  in  MC  produce 
programs  that  are  log  space  complete  for  P.  We  conclude  in  Section  10. 

2  Basic  Definitions 

2.1  Logic  Programs 

An  atom  is  an  atomic  formula,  i.e.,  a  predicate  symbol  with  terms  as  arguments,  as 
customarily  defined  in  logic.  A  literal  is  a  polarized  atom,  i.e.  a  positive  or  negative 
atom.  A  clause  is  a  disjunction  of  literals.  A  unit  clause  has  exactly  one  literal.  A  deSnite 
clause  has  exzw:tly  one  positive  literal.  A  negative  clause  has  no  positive  literal;  the  empty 
clause  is  considered  negative.  Strictly  spealcing,  a  Horn  clause  is  either  a  definite  clause 
or  a  negative  clause;  however,  the  term  is  often  used  loosely  as  a  synonym  for  “definite 
clause”  when  no  confusion  is  likely. 

Definite  clauses  are  also  called  rules,  and  we  shall  normally  use  this  more  descriptive 
term.  The  positive  literal  is  then  called  the  head  of  the  rule,  and  the  negative  literals 
are  called  its  subgoals.  A  Horn  logic  program  is  a  set  of  rules.  An  entity  (atom,  rule, 
or  logic  program)  is  function  free  if  its  terms  are  only  variables  and  constants,  i.e.,  no 
functors  of  positive  arity  appear.  An  entity  is  ground  if  its  terms  contain  no  variables. 
For  our  purposes,  a  logical  query  program  is  a  function-free  Horn  logic  program;  we  often 
abbreviate  this  to  “logic  program” ,  or  even  “program,”  as  that  is  the  only  kind  we  consider. 


Definition  2.1:  A  basic  logic  program  is  a  finite  set  of  definite  rules  containing  two  classes 
of  predicate  symbols: 

•  IDB  (Intentional  Database)  predicates  are  those  that  appear  in  rule  heads,  and 
possibly  in  subgoals  also;  p,Pi,P3,...  denote  IDB  predicates. 

•  EDB  (Extensional  Database)  predicates  are  those  that  appear  only  in  subgoals; 

9>  92>  •  •  •  denote  EDB  predicates. 

An  EDB  fact  is  a  positive  unit  clause  over  an  EDB  predicate,  with  constants  as 
arguments.  The  constants  are  drawn  from  some  countable  set.  An  EDB  instance  (or 
EDB,  for  short)  is  a  finite  set  of  EDB  facts. 

An  extended  logic  program  is  the  union  of  a  basic  logic  program  and  an  EDB  instance; 
the  predicates  appearing  in  the  EDB  instance  are  a  subset  of  the  (syntactically  defined) 
EDB  predicates  in  the  basic  logic  program. 

Borrowing  from  the  terminology  of  context  free  languages,  we  define  a  terminal  rule  to 
be  one  in  which  all  subgoals  are  EDB  predicates,  and  a  nonterminal  rule  to  be  one  with 
at  least  one  IDB  subgoal. 

□ 

We  generally  follow  the  Prolog  convention  that  symbols  beginning  with  an  uppercase 
letter  are  logic  variables,  while  predicate  symbols  and  constants  begin  with  a  lowercase 
letter. 

We  shall  assume  for  simplicity  that  the  rules  of  the  basic  logic  program  contain  no 
constants,  i.e.,  all  predicate  arguments  in  a  rule  are  variables.  Also,  we  assume  that  every 
rule  has  at  least  one  subgoal.  (It  is  easy  to  meet  these  requirements  by  creating  a  limited 
number  of  additional  EDB  predicates  and  requiring  certain  EDB  facts  to  be  present.) 

2.2  Dependence  Structure  and  Rank  of  Rules 

The  predicates  of  a  basic  logic  program  have  a  natural  partition  into  strong  components, 
based  on  the  following  dependence  graph.  Let  each  IDB  predicate  symbol  p,-  and  eaxh  EDB 
predicate  symbol  q,  be  a  node  in  the  graph;  put  an  arc  from  py  (resp.  q^)  to  p,-  if  there  is  a 
rule  whose  head  is  p,-  and  which  has  a  py  (resp.  qj)  subgoal.  The  strong  component  of  an 
IDB  predicate  consists  of  just  the  predicates  in  its  strong  component  in  the  dependence 
graph.  Of  course,  each  EDB  predicate  is  a  singleton  strong  component. 

Definition  2.2:  A  (nonterminal)  rule  is  called  recursive  if  it  has  a  subgoal  in  the  same 
strong  component  as  the  head.  In  a  recursive  rule,  a  recursive  subgoal  is  a  subgoal  in  the 
same  strong  component  as  the  head  of  the  rule.  Other  subgoals  are  nonrecursive.  Thus  the 
same  atom  may  be  a  recursive  subgoal  in  one  rule  zmd  a  nonrecursive  subgoai  in  another. 

□ 


Associated  with  the  dependence  graph  is  the  reduced  dependence  graph.  Each  node 
in  the  reduced  dependence  graph  corresponds  to  a  strong  component  in  the  dependence 
graph.  Put  in  am  arc  from  strong  component  si  to  S]  whenever  some  predicate  in  si  is  a 
subgoal  for  a  rule  whose  head  is  in  S2.  The  reduced  graph  is  acyclic,  of  course. 

Definition  2.3:  We  define  the  rank  of  a  predicate  to  be  the  height  (length  of  longest  path 
to  a  leaf)  of  its  strong  component  in  the  reduced  dependence  graph;  the  ramk  of  a  rule  is 
the  rank  of  its  head.  The  rank  of  each  EDB  predicate  is  0.  □ 

We  observe  that  in  any  rule  the  rank  of  a  recursive  subgoal  equals  the  rank  of  the  head 
of  the  rule,  while  the  rank  of  a  nonrecursive  subgoal  is  less  than  the  rank  of  the  head. 

3  The  Basic  Theorem  Problem 

We  shall  consider  the  pau'allel  time  complexity  of  a  certain  decision  problem  involving 
logic  programs,  which  we  call  the  basic  theorem  problem  for  logic  prograuns.  For  the  basic 
theorem  problem,  a  basic  logic  program  P/  is  given  and  regarded  as  fixed.  Assume  its  IDB 
predicates  are  Pu...,p$  and  its  EDB  predicates  are  The  rules  contain  only 

variables,  but  the  input  language  has  a  countable  set  of  constants.  Recall  our  convention 
that  a  basic  logic  program  contains  no  EDB  facts;  we  use  the  subscript  J  as  a  reminder 
of  this  convention.  For  definiteness  here,  we  denote  vectors  of  constants  that  occur  as 
arguments  by  a,  6,  e,  etc.  Later,  we  shall  drop  the  overbar,  and  let  context  determine 
whether  a,  6,  etc.  represent  single  constants  or  vectors  of  constants.  The  input  to  a 
problem  instance  is  a  ground  clause  Q  of  the  form 

P.(2o)  qu  (2i).  •  •  • .  qjs  M  (1) 

and  the  question  is,  “Is  Q  a  theorem  of  P/?”  In  other  words,  the  basic  theorem  problem 
for  P/  consists  of  recognizing  the  set  of  theorems  of  P/  in  the  form  of  Eq.  1. 

An  alternative  and  equivalent  formulation  of  a  problem  instance  is  to  define  P^  to  be 
the  set  of  EDB  facts,  {9>,(ci),. . .  from  the  right  hsmd  side  of  Eq.  1,  and  define 

P  =  P/  U  Pg;  then  ask  the  question,  “Is  Pi(2o)  a  theorem  of  P?”  In  both  cases  we  define 
the  size  of  the  input  to  be  N,  the  number  of  EDB  literals,  or  faxts,  in  the  input. 

We  wish  to  characterize  basic  logic  programs  according  to  whether  their  basic  theorem 
problem  is  in  .VC,  or  is  log  space  complete  for  P  (P -complete  for  short).  That  the  problem 
is  always  in  P  can  be  seen  from  the  Naive  Evaluation  algorithm  described  below  (see 
[CH82]  for  a  complete  proof).  Loosely  speaking,  we  say  a  basic  logic  program  is  in  VC,  or 
is  P-complete,  when  we  really  mean  that  its  basic  theorem  problem  is.  This  problem  has 
been  called  the  data  complexity  problem  in  [CH82,Var82]. 

Our  computational  model  is  a  PRAM  in  which  concurrent  writes  must  be  consistent 
(the  common  write  model).  We  are  not  concerned  with  representation  issues,  so  make  the 


convenient  assumption  that  constants  in  (i.e.,  the  Herbrand  universe  of  the  extended 
logic  program)  consist  of  a  reasonably  dense  subset  of  some  prefix  of  the  natural  numbers, 
so  that  there  is  no  problem  in  associating  processors  and  global  memory  locations  with  a 
vector  of  EDB  constants.  The  treatment  of  P/  as  fixed  is  important  in  our  accounting. 
In  particular,  there  are  integers  a  and  b  that  represent  the  maximum  arities  of  any  IDB 
predicate  and  any  EDB  predicate,  respectively,  but  depend  only  on  P/  emd  not  on  the 
EDB,  Pfi,  of  a  particular  problem  instance.  We  are  assured  of  having  at  most  Nb  different 
constants  in  a  Pb  of  size  N.  Let  v  possibly  depend  on  P/,  but  not  on  Pg.  The  number 
of  v-tuples  of  EDB  constants  is  then  at  most  (iVfc)",  i.e.,  is  polynomial  in  the  size  of  the 
input,  and  we  can  consider  assigning  a  separate  processor  or  memory  location  to  each  such 
tuple.  We  assume  that  once  the  EDB  is  given,  any  processor  can  check  whether  a  ground 
atom  is  an  EDB  fact  and  can  associate  a  memory  address  with  a  v-tuple  of  EDB  constants 
in  constant  time. 

4  Semantics  of  Logic  Programs  and  Derivation  Trees 

Following  [VEK76,AVE82],  we  define  the  Herbrand  base  t/  of  an  extended  logic  program 
P  to  be  the  set  of  variable-free  atoms  containing  no  predicates  or  constants  other  than 
those  occurring  in  P.  An  interpretation  is  a  subset  of  U. 

With  a  logic  program  P  we  associate  a  function  Tp  from  interpretations  to  interpreta¬ 
tions.  Let  /  be  an  interpretation.  We  define  a  ground  atom  a  to  be  in  Tp{I)  if  amd  only  if 
there  exists  in  P  a  rule  6o  6i , . . . ,  and  there  exists  a  substitution  $  such  that: 

•  a  is  syntactically  identical  to  bod,  the  instantiated  head  of  the  rule,  and 

•  I  contains  every  instantiated  subgoal,  6i^, . . . ,  6„^. 

We  define  a  lattice  whose  set  is  the  powerset  of  the  Herbrand  base,  and  whose  partial 
order  is  set  inclusion.  If  P  is  a  definite  logic  program,  then  Tp  is  monotone  with  respect 
to  this  order.  By  the  Knaster-Tarski  Fixpoint  Theorem  every  monotone  function  has  a 
least  fixpoint.  It  is  well  known  [VEK76)  that  the  least  fixpoint  of  Tp  is  the  minimum 
Herbrand  model  of  P  (i.e.,  is  a  subset  of  every  Herbrand  model  of  P).  Moreover,  in  the 
function-free  c^e  there  is  an  integer  K  such  that  Tp  (0)  equals  the  minimum  model.  Let 
us  call  an  interpretation  a  partial  model  if  it  is  a  subset  of  the  minimum  model.  Then 
Tp(ill)  is  clearly  a  partial  model  for  all  natural  numbers,  k. 

The  EDB  portion  of  the  minimum  model  is  simply  the  set  of  EDB  facts  (since  by 
definition  EDB  predicates  never  appear  in  the  head  of  a  rule),  so  we  take  for  granted  that 
EDB  facts  are  included  in  all  partial  models  we  compute.  Consequently,  when  we  talk 
about  representing  or  computing  “the  model,”  we  mean  the  IDB  portion  of  the  minimum 
Herbrand  model. 


Definition  4.1:  Given  an  extended  logic  program  P,  a  derivation  tree  for  a  ground  atom 
is  a  rooted  tree  with  atoms  as  nodes  and  edges  between  parents  and  children,  such  that 

•  p°  is  the  root 

•  for  every  internal  node  r°  whose  children  are  there  is  some  ground  rule 

instance 

r° 

,  where  r*  are  atoms  with  various  predicate  symbols. 

•  every  node  is  in  the  minimtim  model  of  P;  leaves  are  not  necessarily  in  the  EDB. 

A  nonterminal  derivation  tree  Is  one  in  which  each  internal  node  has  at  least  one 
IDB  atom  among  its  children;  i.e.,  only  nonterminal  rules  were  used  in  the  derivation.  A 
complete  derivation  tree  is  one  in  which  all  leaves  are  EDB  facts. 

By  path  in  a  derivation  tree  we  mean  a  directed  path  away  from  the  root.  The  fringe 
of  a  tree  is  the  set  of  its  leaves.  A  tight  derivation  tree  is  one  in  which  no  node  has  an 
ancestor  identical  to  itself;  i.e.,  no  atom  occurs  twice  on  one  path.  □ 

5  PRAM  Algorithms  for  Logical  Query  Programs 

We  begin  by  describing  fairly  general  parallel  algorithms  for  computing  the  minimum 
model  of  an  extended  logic  program.  Analysis  of  these  algorithms  leads  to  a  sufficient 
condition  for  a  basic  logic  program  to  be  in  MC.  (Cleeirly  the  basic  theorem  problem  of 
P/  is  in  MC  if  and  only  if  computation  of  the  minimum  model  of  P/  U  P^  is  in  MC:  Since 
the  Herbrand  base  is  of  polynomial  size,  we  can  consider  ail  czmdidates,  Pi(£),  in  parallel.) 

Let  N  be  the  length  of  an  input  to  the  basic  theorem  problem.  Recall  that  the 
input  consists  of  one  IDB  ground  atom  (the  "top-level  goal”)  and  N  EDB  ground  atoms 
(EDB  facts),  and  that  the  problem  is  to  decide  whether  the  EDB  facts  together  with  the 
nonterminal  rules  imply  the  top-level  goal.  The  number  of  constants  in  the  input  is  0{N). 

This  problem  is  well-known  to  be  in  P  in  view  of  the  algorithm  Naive  Evaluation,  which 
simply  computes  Tp{9),. . .  ,Tp{9)  until  a  fixpoint  is  reached  |AU79,CH82].  If  parts  (b) 
and  (c)  are  removed  from  our  Basic  Evaluation  algorithm  below,  the  result  is  a  PRAM 
implementation  of  Naive  Evaluation.  Each  iteration  can  be  done  in  constant  time,  but 
it  is  easy  to  find  MC  examples,  such  as  “transitive  closure,”  for  which  K  =  0(N)  for 
Naive  Evaluation.  Parts  (b)  and  (c)  provide  the  speed-up  needed  to  bring  a  large  class  of 
programs  down  to  poly-log  PRAM  time. 

Algorithm  5.1:  (Basic  Evaluation)  The  constants  appearing  in  the  EDB  comprise  the 
Herbrand  universe.  "Instantiations”  refer  to  instantiations  of  variables  to  these  constants. 
We  shall  build  the  minimum  Herbrand  model  of  the  extended  logic  program  by  constructing 
a  directed  graph  in  which  the  nodes  are  the  IDB  atoms  in  the  Herbrand  base,  plus  a  node 
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for  true.  We  call  this  graph  the  implication  graph  of  the  extended  logic  program.  We 
regard  an  atom  as  being  in  the  partial  model  (and  call  that  atom  true)  whenever  there  is 
an  arc  from  true  to  that  atom  in  the  implication  graph.  Initially  the  implication  graph 
has  no  edges;  the  partial  model  is  empty. 

Initialization:  For  each  terminal  rule,  evaluate  all  its  instantiations  in  parallel;  for 
those  proved  by  the  EDB  facts,  put  in  an  arc  from  true  to  the  node  corresponding  to 
the  instauitiated  head  of  the  rule  thereby  adding  it  to  the  partial  model.  For  each  other 
rule,  consider  all  its  instantiations  in  parallel;  those  such  that  all  EDB  subgoals  are  EDB 
facts  are  “live,”  aind  are  simplified  by  removing  the  EDB  subgoals.  Rule  instances  with  am 
EDB  subgoal  for  which  there  is  not  a  matching  EDB  fact  can  be  discarded  from  further 
consideration.  If  v  is  the  maximum  number  of  variables  in  any  nonterminal  rule,  then 
O(W')  processors  can  accomplish  the  foregoing  in  a  constant  number  of  steps. 

Iteration:  Perform  a  new  iteration  stage  as  long  as  any  new  IDB  atom  was  derived 
(i.e.,  a  new  arc  from  true  to  some  node  was  added)  in  the  previous  stage. 

Part  (a)  Begin  by  checking  each  live  rule  instance  in  parallel.  Remove  any  of  its 
remaining  (IDB)  subgoals  that  are  now  “true,”  i.e.,  in  the  partial  model.  If  it  has  no 
subgoals  left,  put  an  arc  from  true  to  the  head  of  the  rule  instance  into  the  implication 
graph. 

Part  (b)  If  a  live  rule  instance  has  one  subgoal  left,  suppose  it  has  been  reduced  to 
p*  p*.  Put  the  arc  p*  — ►  p*  into  the  implication  graph;  i.e.,  put  in  an  arc  from  the  node 
corresponding  to  the  instantiated  subgoal  to  the  node  corresponding  to  the  instantiated 
head  of  the  rule. 

Part  (c)  Now  transitively  close  the  current  implication  graph.  If  o  is  the  maximum 
number  of  arguments  in  any  IDB  predicate,  the  implication  graph  has  0(W“)  nodes,  so 
the  transitive  closure  can  be  done  in  poly-log  time  with  “only”  0{N^“)  processors.  Note 
that  any  nodes  that  become  reachable  from  true  are  now  in  the  partial  model.  This 
completes  one  stage  of  the  iteration. 

Termination:  When  nothing  has  been  added  to  the  partial  model  in  an  iteration  stage, 
halt.  Lemma  5.1  shows  that  the  minimum  Herbrand  model  has  been  constructed.  For  the 
basic  theorem  decision  problem,  the  top-level  goal  atom  is  a  theorem  precisely  if  it  is  in 
this  model.  □ 

Lemma  5.1:  Algorithm  5.1  correctly  computes  the  minimum  Herbrand  model  of  the 
extended  logic  program. 


Proof  :  The  proof  follows  the  proofs  in  [VEK76,AU79,CH82]  that  7p(0)  converges  to  the 
least  fixpoint,  finitely  in  the  function-free  case.  The  only  difference  is  that  arcs  in  the 
implication  graph  are  used  also.  But  arc  a  — *  6  is  easily  seen  to  correspond  to  a  theorem 
(or  lemma,  if  you  will),  “a  implies  b,”  and  transitive  closure  (part  (c))  amounts  to  reasoning 
by  modus  ponens.  | 


The  PRAM  time  for  an  iteration  stage  of  Algorithm  5.1  is  easily  seen  to  be  constant  for 
parts  (a)  and  (b),  and  poly- log  for  part  (c).  In  order  to  show  that  a  basic  logic  program 
is  in  MC,  it  is  sufficient  to  show  that  Algorithm  5.1  requires  only  poly-log  stages  when 
applied  to  this  program.  Theorem  6.2  provides  a  general  tool  for  this  purpose.  But  first, 
we  present  a  modification  of  Basic  Evaluation. 

Algorithm  5.2:  (Fast  Evaluation)  All  parts  of  this  algorithm  are  the  same  as  Algo¬ 
rithm  5.1  (Basic  Evaluation)  except  part  (c)  of  the  iteration  stage,  which  is  replaced  by: 

Part  (c)  Now  do  one  “transitive  closure  step”  in  the  current  implication  graph.  Specif¬ 
ically,  in  parallel,  if  p*  — ►  p*  and  p*  -+  p*  are  already  in  the  implication  graph,  then  put  in 
arc  p’  — *  p*.  Again,  if  a  is  the  maximum  number  of  arguments  in  any  IDB  predicate,  the 
implication  graph  has  0(7^“)  nodes,  so  this  step  cam  be  done  in  constant  time  with  “only” 
0(7V*“)  processors.  Note  that  any  nodes  that  become  reachable  from  true  axe  now  in  the 
partial  model.  □ 

A  little  thought  reveals  that  Fast  Evaluation  computes  the  same  implication  graph  as 
Basic  Evaluation,  due  to  the  monotonicity  of  both  algorithms.  Assuming  Basic  Evaluation 
does  “transitive  closure”  by  repeated  application  of  part  (c)  of  Fast  Evaluation,  it  is  further 
evident  that  Fast  Evaluation  is  at  least  as  fast  as  Basic  Evaluation.  In  fact,  in  the  cases  we 
can  analyze,  both  algorithms  require  0(log  N)  stages,  but  Basic  Evaluation  may  require 
0(log  TV)  per  stage,  while  Fast  Evaluation  requires  only  constant  time  per  stage.  Thus  the 
name  “Fast  Evaluation”  is  justified. 

6  Polynomial  Fringes  and  )^C 

In  this  section  we  define  a  key  property  for  basic  logic  programs,  the  polynomial  fringe 
property,  and  show  that  programs  with  this  property  run  in  poly-log  time  under  the  Basic 
and  Fast  Evaluation  algorithms  presented  in  Section  5.  Ruzzo  heis  obtained  interesting 
related  results  for  alternating  Turing  machines  with  polynomial  sized  computation  trees 
[RuzSO];  however,  his  tree  transformations  appear  to  be  based  on  a  different  principle 
from  ours,  involving  a  centroid  concept.  Our  methods  are  more  analogous  to  a  method 
of  parallel  evaluation  of  tree  expressions  used,  e.g.,  in  [VSBR83],  [MR85],  and  [AH85]. 
What  is  somewhat  novel  in  our  use  of  tree  transformations  is  that  our  algorithm  does  not 
explicitly  process  (or  preprocess)  the  tree;  we  only  use  the  tree  as  a  device  to  analyze  the 
algorithm. 

Definition  0.1:  A  basic  logic  program  has  the  polynomial  fringe  property  (relative  to  a 
class  of  EDB’s  D)  if  for  every  EDB  (in  class  0)  and  every  atom  in  the  minimum  model  of 
the  resulting  extended  logic  program,  that  atom  has  a  derivation  tree  whose  fringe  is  of 
polynomial  length  in  the  size  of  the  EDB.  If  the  class  P  is  not  mentioned,  it  is  taken  to  be 
all  possible  EDB’s.  □ 
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Next  we  quote  a  useful  lemma,  which  allows  us  to  restrict  attention  to  tight  derivation 
trees,  which  have  polynomial  depth.  Recall  that  a  derivation  tree  is  tight  if  no  atom  occurs 
twice  on  one  path  from  the  root.  Thus  in  tight  derivation  trees,  a  polynomial  number  of 
nodes  is  equivalent  to  a  polynomial  fringe  length. 

Lemma  6.1:  Every  atom  in  the  minimum  model  of  a  logical  query  program  has  a  tight 
derivation  tree,  whose  depth  is  polynomial  in  the  size  of  the  EDB. 

Proof :  If  a  complete  derivation  tree  for  atom  a  is  not  tight,  then  there  is  another  complete 
derivation  tree  for  a  with  fewer  nodes  and  no  more  leaves,  obtained  as  follows:  Find  the 
node  b'  with  identical  ancestor  6,  and  substitute  the  subtree  rooted  at  V  in  place  of  the  b 
subtree.  It  follows  that  the  minimum  complete  derivation  tree  for  a  is  tight.  Since  there 
are  only  polynomially  many  instances  of  IDB  literals  in  the  Herbrand  base,  and  no  instance 
appears  twice  on  one  path  in  a  tight  derivation,  the  lemma  follows.  | 

The  following  theorem  is  a  weak  form  of  the  Polynomial  Fringe  Theorem  (Theorem  6.3), 
but  serves  as  an  introduction  to  many  of  the  ideas  in  a  simpler  form. 

Theorem  0.2:  A  basic  logic  program  with  the  polynomial  fringe  property  is  in  MC. 

Proof :  We  show  that  for  any  IDB  atom  p°  in  the  minimum  model  of  P,  if  p°  has  a  complete 
derivation  tree  with  fringe  F,  then  Algorithm  5.1  requires  at  most  log,  |F)  +  1  stages  to 
put  p°  into  the  model  it  constructs,  where  1F|  is  the  number  of  atoms  in  F. 

In  order  to  track  the  progress  of  the  computation  as  it  relates  to  p°,  we  define  the 
remaining  derivation  tree  for  p°  to  be  the  original  (complete)  derivation  tree  with  all  nodes 
that  so  far  have  been  put  into  the  partial  model  removed.  Accordingly,  at  the  beginning 
of  the  first  iteration  stage,  all  EDB  facts  have  been  pruned  from  the  remaining  derivation 
tree,  and  it  contains  only  IDB  atoms.  At  the  beginning  of  any  iteration  stage,  the  fringe 
consists  of  atoms  that  will  be  put  into  the  partial  model  during  this  stage,  as  all  the 
required  subgoal  instances  have  been  put  in  during  preceding  stages.  Thus  every  atom 
in  the  fringe  at  the  beginning  of  the  stage  is  pruned  from  the  remaining  derivation  tree 
during  part  (a)  of  that  stage. 

The  key  observation  is  that  amy  atom  that  is  in  the  fringe  at  the  end  of  an  iteration 
stage  had  at  least  two  descendant  atoms  that  were  in  the  fringe  at  the  beginning  of  that 
stage.  For  suppose  some  atom  p^  has  only  one  descendant  p^  in  the  fringe  at  the  beginning 
of  the  stage.  Then  it  must  have  only  one  remaining  subgoal  (i.e.,  not  yet  in  the  partial 
model).  Moreover,  every  descendant  of  p‘  other  than  p*  must  have  had  only  one  remaining 
subgoai.  Then  arcs  existed  in  the  implication  graph  that  form  a  path  from  p^  to  p^,  at 
the  beginning  of  the  stage.  Since  p*  was  in  the  fringe,  the  arc  from  true  to  p*  was  put  in 
during  part  (a),  completing  a  path  from  true  to  p*  in  the  implication  graph.  Therefore, 
during  part  (c)  an  arc  from  true  to  p*  was  put  in  the  implication  graph,  making  p^  part  of 
the  partial  model.  Thus  p'  with  only  one  fringe  descendant  cannot  be  in  the  new  fringe. 


Since  every  new  fringe  atom  had  two  fringe  descendants  in  the  remaining  derivation 
tree  of  the  preceding  stage,  the  size  of  the  fringe  halves  during  each  iteration  stage.  Thus, 
the  root  is  put  into  the  partial  model  within  log,  |F|  +  1  stages.  Since  each  stage  requires 
poly-log  time  and  uses  a  polynomial  number  of  processors,  the  theorem  follows.  | 

Example  6.1:  Suppose  atom  has  the  remaining  derivation  tree  shown  in  Fig.  1  after 
the  initialization  stage  of  the  Basic  Evaluation  algorithm,  and  assume  it  has  no  other 
derivations.  It  is  evident  that  Naive  Evaluation  requires  four  steps  to  derive  p^.  Let  us 
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Figure  1;  An  unbalanced  remaining  derivation  tree 
track  the  progress  of  Basic  Evaluation. 

(0)  Implication  graph  empty.  Remaining  derivation  tree  is  as  shown  in  Fig.  1,  with  four 
fringe  nodes. 

(la)  Implication  graph  gets  arcs  from  true  to  p^,  p®,  p®,  and  p®. 

(lb)  Implication  graph  gets  arc  p®  —*  p*. 

(lc)  Transitive  closure  puts  in  arc  from  true  to  p*.  Remaining  derivation  tree  is  reduced 
to  p',  p’,  and  p*. 

(2a)  Implication  graph  gets  arc  from  true  to  p*. 

(2b)  Implication  graph  gets  arcs  p*  -*  p*  and  p’  — ►  p^. 


(2c)  Transitive  closure  puts  in  arcs  from  true  to  p*  auid  p  ,  as  well  bs  p*  p^.  Remaining 
derivation  tree  is  now  empty. 

Thus  Basic  Evaluation  derives  p^  in  two  stages.  Observe  that  stage  (2c)  requires  two 
“transitive  closure  steps.”  (Realistically,  a  third  step  is  also  needed  to  see  that  nothing 
new  is  found.)  Because  Fast  Evaluation  does  not  do  a  full  transitive  closure  in  each  stage, 
it  requires  three  stages.  On  problems  of  any  nontrivial  size,  the  reduced  time  per  stage 
more  than  maJces  up  for  the  additional  stages.  □ 

Programs  with  the  polynomial  fringe  property  run  in  log*  N  PRAM  time  under  Ba¬ 
sic  Evaluation.  It  is  straightforward  to  simulate  this  computation  with  a  log*  N  depth 
Booleein  circuit  of  bounded  fan-in  gates.  (It  is  easy  to  see  that  log*  N  PRAM  time  in  our 
algorithms  translates  to  log*'*'^  N  depth  for  a  bounded  fan-in  circuit,  and  to  log*  N  depth 
for  polynomial  fan-in.)  The  class  >/C*  essentially  consists  of  problems  that  can  be  solved 
by  “uniform”  circuits  of  depth  0(log*I\r)  using  a  polynomial  number  of  bounded  fan-in 
gates;  we  omit  the  technical  uniformity  condition  [Coo84].  Since  our  class  of  problems 
includes  “transitive  closure,”  log*  N  depth  is  the  best  known  bound  amd  is  likely  optimal, 
so  it  is  of  some  interest  to  show  that  Fast  Evaluation  achieves  this  bound  on  programs 
with  the  polynomial  fringe  property.  Our  proof  uses  the  node-counting  method  of  [MR85]. 

Theorem  6.3;  (Polynomial  Fringe  Theorem)  A  basic  logic  program  with  the  polynomial 
fringe  property  is  in 


Proof :  We  show  that  for  any  IDB  atom  p°  in  the  minimum  model  of  P,  if  p°  has  a 
nonterminal  complete  tight  derivation  tree  T  with  jT]  nodes,  then  Algorithm  5.2  (Fast 
Evaluation)  requires  at  most  log*  |r|  stages  to  put  p°  into  the  model  it  constructs.  Since 
the  polynomial  fringe  property,  together  with  Lemma  6.1,  guarantee  that  |r|  is  polynomial 
in  N,  the  theorem  follows. 

In  order  to  track  the  progress  of  the  computation  as  it  relates  to  p°,  we  define  an  effective 
derivation  tree  for  p°.  As  in  Theorem  6.2,  this  tree  is  initially  a  complete  derivation  tree; 
however  the  updating  operation  is  more  complicated.  As  before,  all  nodes  that  are  put 
into  the  pzirtial  model  (as  well  as  nodes  corresponding  to  EDB  facts)  are  removed.  In 
addition,  all  “chains”  are  shortened  in  accordance  with  arcs  in  the  current  implication 
graph.  Specifically,  starting  with  the  root  of  the  current  effective  derivation  tree  and 
working  down,  if  there  is  a  sequence  of  nodes  p*,p*, . . .  ,p'’  such  that: 

•  p*  hzis  only  one  child,  p*"^*  for  1  <  *  <  r; 

•  p'  has  no  children  or  several  children; 

•  All  arcs  p’'*'*  -♦  p’  for  1  <  i  <  r  are  in  the  current  implication  graph; 


then  we  call  a  maximal  chain.  Given  a  maximal  chain  let  i  be  the 

largest  index  such  that  p*  —*  is  in  the  current  implication  graph.  Then  we  update  the 
effective  derivation  tree  by  making  p*  the  (only)  child  of  p^,  and  removing  p*, . . .  ,p*“^  from 
the  effective  derivation  tree.  This  type  of  updating  continues  until  it  is  no  longer  possible, 
always  operating  on  the  maximal  chain  closest  to  the  root  when  there  is  a  choice. 

Thus  edges  (e.g.,  between  p^  and  p*  in  the  above  example)  can  appear  in  the  effective 
derivation  tree  that  were  not  in  the  original  derivation  tree. 

Let  Tif  be  the  effective  derivation  tree  after  stage  k,  and  let  Fk  be  its  fringe.  Accordingly, 
in  To,  all  EDB  facts  have  been  pruned  from  7*,  and  it  coincides  with  the  remaining 
derivation  tree  at  this  point.  However,  as  the  iteration  proceeds,  effective  derivation  trees 
diverge  from  remaining  derivation  trees,  in  general. 

At  the  beginning  of  any  iteration  stage,  the  fringe  consists  of  atoms  that  will  be  put 
into  the  partial  model  during  this  stage,  as  all  the  required  subgoal  instances  have  been 
put  in  during  preceding  stages.  Thus  every  atom  in  the  fringe  at  the  beginning  of  the 
stage  is  pruned  from  the  effective  derivation  tree  during  part  (a)  of  that  stage.  No  change 
occurs  during  part  (b). 

In  order  to  describe  the  changes  in  the  effective  derivation  tree  that  occur  during 
part  (c)  of  stage  k  of  Fast  Evaluation,  we  define  a  homeomorphism  h  on  T*,  the  effective 
derivation  tree  at  the  beginning  of  stage  k  (i.e.,  before  part  (a)).  Let  Uk  =  h.{Tk)  be  formed 
by  mapping  every  node  of  T*  that  has  exactly  one  child  into  its  nearest  descendant  that 
either  is  a  leaf  or  has  at  least  two  children.  Other  nodes  of  Tk  map  to  themselves,  and 
provide  the  “names”  of  the  nodes  in  17*.  In  particular,  F*  maps  to  itself.  For  u  €  l^*,  or 
equivalently,  for  u  €  T*  such  that  h{u)  —  u,  define  h~^(u)  to  be  the  set  of  nodes  that  map 
into  «,  and  define  ehain{u)  =  h~^{u)  —  {u}.  Every  internal  node  of  (7*  has  at  least  two 
children.  It  follows  (c/.  complete  binary  trees)  that  (I7*(  <  2|F*(. 

For  any  u  6  17*,  order  ekain{u)  nodes  by  distance  from  the  root,  and  call  them  p^, . . . ,  p**. 
During  part  (c)  of  stage  fc  +  1  of  the  Fast  Evaluation  algorithm,  arcs  p’  — *•  p^,  p*  — ►  p*, 
p®  — ►  p*,  etc.,  will  be  put  into  the  implication  graph.  The  effective  derivation  tree  will  be 
updated  as  follows:  p®  will  become  the  child  of  p*  and  p®  will  be  removed;  p®  will  become 
the  child  of  p®  and  p*  will  be  removed;  etc.  ^  In  summary,  if  chain(u)  contains  r  nodes  in 
Tk,  it  will  shrink  to  [jr]  nodes  in  Tk+i-  We  have  the  following  relationships: 

ini  =  E  =  it'.l  +  E 

ueUt  ueUt 

in+,i  =  |ir*i-|F,i+ E 

ueuu 

It'il  <  2|nl 

‘Arcs  p*  — »  p^  and  others  remain  in  the  implication  graph,  of  course,  but  play  no  further  role  in  this 
effective  derivation  tree. 


If  we  regard  \Tk\  as  fixed  and  regard  \Uk\,  |Fk|,  and  |eAam(u)|  as  variables,  and  look  for 
the  maximum  value  of  |7fc+i|,  it  can  be  shown  (with  Lagrange  multipliers,  e.g.)  that  the 
maximum  occurs  when  |cAam(«)|  =  1  for  all  u  €  Uk.  With  this  substitution,  \Uk\  =  ||7*| 
and 

Thus  |7*|  will  be  0  after  at  most  log*  |r|  stages. 

To  complete  the  proof,  it  is  only  necessary  to  note  that  whenever  a  node  is  removed  from 
the  effective  derivation  tree,  either  it  is  in  the  partial  model  (i.e.,  it  was  in  the  fringe),  or  it 
has  a  pzurent.  Thus  the  root  p°  must  be  in  the  partial  model  when  the  effective  derivation 
tree  has  been  reduced  to  nothing.  | 

7  Preliminary  Results  on  Programs  in  A/C 

In  this  section  we  mention  several  known  results  that  are  easy  corollaries  of  the  Polynomial 
Fringe  Theorem. 

7.1  Context  Free  Language  Recognition 

Deciding  whether  an  input  string  is  in  a  (fixed)  context  free  language,  i.e.,  the  nonuniform 
word  problem,  has  been  shown  to  be  in  MC  in  (RuzSO]  through  a  series  of  simulations. 
In  [VSBR83]  it  is  transformed  into  a  polynomial  evaluation  problem.  Here  we  obtain  the 
result  as  a  corollary,  with  a  straightforward  algorithm  as  well. 

Corollary  7.1:  The  word  problem  for  a  fixed  context  free  language  is  in  1/C’. 

Proof:  Let  C  be  a  grammar  for  the  language,  in  Chomsky  normal  form  (other  epsilon- 
free  forms  will  also  work).  The  nonterminal  symbols  in  the  grammar  (denoted  somewhat 
unconventionally  by  lowercase  letters)  will  correspond  to  IDB  predicates,  and  the  terminal 
symbols  (denoted  by  Greek  letters)  will  correspond  to  EDB  predicates.  For  each  production 
a  — ♦  6c  we  create  a  nonterminal  rule 

a{I,K)  b(I,J),c(J,K). 

and  of  course  a  —*  a  becomes  the  terminal  rule 

a(/,J)  a{I,J). 

Now  our  Fast  Evaluation  algorithm  based  on  this  IDB  will  not  necessarily  run  in  poly¬ 
log  time  on  an  wbritrary  EDB.  However,  to  recognize  strings  in  the  language,  we  only 
need  to  process  certain  EDB’s.  In  particular,  if  the  input  string  is  ai  •  •  •  a„,  we  put  the 
facts  ai(0, 1),  02(1,2),  . . . ,  a„{n  -  l,n)  into  the  EDB.  We  take  all  such  EDB’s  to  be  our 


class  5.  It  is  evident  that  an  IDB  atom  a(i,j)  is  derivable  only  if  it  has  a  derivation  tree 
with  fringe  equal  to  the  substring  of  symbols  t  + 1  through  j  of  the  input  string.  (Note  that 
this  fact  depends  on  having  no  epsilon  productions.)  This  fringe  is  of  length  j  -  i.  Hence 
the  program  has  the  polynomial  fringe  property  relative  to  5  and,  restricted  to  EDB’s  in 
5,  is  in  MC.  I 

7.2  Linear  and  Piecewise  Linear  Programs 

That  linear  logic  programs  are  in  >/C  has  independently  been  discovered  by  M.  Vardi  and 
D.  Maier,  P.  Kanellakis,  and  possibly  others.  (We  are  not  aware  of  any  published  version 
of  this  “folk  theorem.”)  Piecewise  linear  programs  are  a  natural  generalization.  Perhaps 
they  comprise  the  largest  previously  known  class  of  JJC  logic  programs. 

Definition  7.1:  A  piecewise  linear  logic  program  is  one  in  which  each  rule  has  at  most 
one  recursive  subgoal.  If  in  addition  each  rule  contains  at  most  one  IDB  subgoal,  the 
program  is  said  to  be  linear.  If  a  rule  contains  two  or  more  recursive  subgoals,  then  that 
rule  and  the  logic  program  are  said  to  be  nonlinear.  Recall  that  any  nonrecursive  IDB 
subgoal  is  necessarily  of  lower  rank  than  the  head  of  the  rule.  □ 

Corollary  7.2:  Any  piecewise  linear  logical  query  program  is  in  >/C. 

Proof:  We  use  induction  on  k,  the  rank  of  derivable  atoms,  and  show  that  atoms  of  rank 
k  have  a  polynomial  bound  Qk  on  the  fringe  length  of  their  minimum  complete  derivation 
trees. 

The  basis,  fc  =  0,  is  trivial  as  these  axe  EDB  predicates,  and  we  may  choose  Qo  =  !• 
Now  for  k  >  0  assume  that  derivable  atoms  of  rank  less  than  k  have  complete  derivation 
trees  whose  fringe  lengths  are  bounded  by  polynomial  Qk-i-  Consider  the  rules  of  (any) 
one  strong  component  of  rank  k.  Any  derivable  atom  in  that  component  is  derivable 
with  these  rules,  together  with  rules  of  lower  rank.  For  these  predicates  (of  rank  k),  each 
internal  node  of  a  complete  derivation  tree  has  at  most  one  recursive  child  and  a  number 
of  nonrecursive  children  bounded  by  5,  the  maximum  number  of  subgoals  in  any  rule  of 
the  IDB.  All  the  nonrecursive  children  have  rank  less  than  k,  so  have  fringes  bounded  by 
Qk-i.  Thus  the  total  fringe  length  is  within  a  factor  of  sQk-i  of  the  depth.  By  Lemma  6.1, 
every  derivable  atom  has  a  polynomial  depth  derivation  tree.  Thus  there  is  a  polynomial 
bound  Qk,  on  the  fringe  length  of  minimum  complete  derivation  trees  for  atoms  of  rank  k. 
Since  the  maocimum  rank  is  bounded,  the  program  has  the  polynomial  fringe  property.  | 

7.3  Elementary  Chain  Rules 

When  we  consider  the  class  of  nonlinear  basic  logic  programs,  even  those  with  a  single 
recursive  rule,  and  therefore  a  single  recursive  predicate  p,  the  water  turns  muddy  very 


quickly.  However,  if  we  further  restrict  ourselves  to  elementaxy  chain  rules  (defined  below), 
we  can  state  some  results.  In  this  section  we  define  terms  and  state  results  for  certain 
nonlinear  programs  that  have  linear  equivalents.  In  subsequent  sections  we  examine  the 
more  difficult  question  of  nonlinear  programs  with  no  linear  equivalents. 

Definition  7.2:  We  say  that  a  logic  program  is  in  join  normal  form  if: 

•  Every  variable  that  appears  in  the  head  of  a  recursive  rule  also  appears  in  some 
subgoal  of  that  rule. 

•  Every  variable  in  a  recursive  rule  appears  in  some  recursive  atom  (either  the  head 
or  a  recursive  subgoal) . 

□ 

Definition  7.3:  Two  basic  logic  programs  are  said  to  be  equivalent  with  respect  to  a  set 
of  IDB  predicates  S  if  the  minimum  models  of  both  prograims  extended  with  the  same 
EDB,  restricted  to  predicates  in  S,  are  the  same.  □ 

Any  basic  logic  program  can  be  rewritten  into  one  in  join  normal  form  that  is  equivalent 
with  respect  to  the  original  IDB  predicates.  If  it  started  with  one  recursive  rule  it  will 
finish  with  one  recursive  rule.  In  general,  new  predicates  and  nonrecursive  rules  for  them 
will  be  introduced. 

Example  7.1:  Suppose  variables  X  and  Y  appear  only  in  nonrecursive  subgoals  qi,  q^ 
and  p]  of  a  recursive  rule  in  a  basic  logic  program,  which  we  suppose  to  be: 

PxiU,W,Z)  qx{X,UU2{X,Y),p^{U,V,Y),pi[y,W,Z). 

and  suppose  pt,...,P4  are  the  IDB  predicates  already  defined.  We  can  define  a  new 
nonrecursive  IDB  predicate  ps  with  the  rule: 

P6(C/,V)  :-  qi{X,U),q2{X,Y),p2{U,V,Y). 

Observe  that  the  offending  variables  X  and  Y  are  “projected  out”  in  relational  algebra 
terminology.  The  variables  U  and  V  appearing  in  ps  are  just  those  that  appeared  in  qi,  92 
or  P2  and  in  a  pi  atom  as  well.  Now  we  can  replace  the  rule  given  for  pi  by: 

Pi([/,IV,Z)  :-Ps(U,V),pUV,W,Z). 

□ 

Definition  7.4:  An  elementary  chain  is  (1)  an  ordered  list  of  subgoals  (atoms)  all  of  whose 
arguments  are  variables  and  (2)  an  ordered  partition  of  each  predicate’s  arguments,  for 
each  predicate  appearing  in  the  list  (possibly  in  several  subgoals).  Each  partition  is  into 
left  and  right  blocks,  such  that: 


•  Certain  variables  appear  only  in  the  left  block  of  the  first  subgoal,  and  are  called  the 
left  block  of  variables  of  the  chain. 

•  Certain  variables  appear  only  in  the  right  block  of  the  last  subgoal,  and  aire  called 
the  right  block  of  variables  of  the  chain. 

•  All  variables  that  do  not  appear  in  the  left  or  right  blocks  appear  precisely  in  two 
adjacent  subgoals.  The  variables  in  the  right  block  of  a  subgoal  (other  than  the  last 
subgoal)  appear  in  the  same  order  in  the  left  block  of  the  following  subgoal. 

An  elementary  chain  rule  is  a  rule  whose  head  contains  predicate  p,  in  which  the  arguments 
of  p  can  be  partitioned  into  two  blocks,  called  the  left  block  auid  right  block,  and  the 
subgoals  caji  be  ordered  to  form  an  elementary  chain,  and  moreover: 

•  The  variables  in  the  left  block  of  the  head  of  the  rule  appear  in  the  same  order  in 
the  left  block  of  the  chain  of  subgoals. 

•  The  variables  in  the  right  block  of  the  head  of  the  rule  appear  in  the  same  order  in 
the  right  block  of  the  chain  of  subgoals. 

•  If  p  appears  as  a  subgoal,  the  p  subgoals’  block  definitions  are  the  same  as  the  head’s 
block  definitions. 

An  elementary  strong  component  is  one  for  which  there  is  a  simultaneous  assignment  of  left 
and  right  blocks  to  all  predicates  in  that  component  that  makes  all  rules  for  those  predicates 
elementary  chain  rules.  An  elementary  program  is  one  that  contains  only  elementary 
components  and  is  in  Join  normal  form. 

Finally,  an  instantiated  elementary  chain  or  instantiated  elementary  chain  rule  is  one 
in  which  the  variables  have  been  instantiated  to  constants.  □ 

Clearly,  we  can  permute  the  arguments  of  p  to  make  the  left  block  precede  the  right 
block,  so  we  assume  this  is  the  case.  We  write  p(X,  Y)  letting  X  represent  the  vector  of 
variables  in  the  left  block  and  Y  the  right  block.  We  can  represent  chains  succinctly  by 
writing  their  arguments  above  and  between  their  predicate  symbols,  like  this: 

U  V  W  X  Y  Z 
go  p  gi  go  <}2 

Here  the  leftmost  subgoal  is  qo{U,V)  and  it  is  followed  by  p{V,W),  etc.  The  left  block  of 
the  whole  chain  is  U  and  the  right  block  is  Z. 

If  the  blocks  are  understood,  we  can  omit  the  variables  without  ambiguity  when  writing 
out  uninstantiated  chains.  It  is  also  apparent  that  we  can  replace  a  p  subgoal  by  a 
chain  of  subgoals  with  fresh  variables  and  appropriate  unifications,  as  illustrated  in  the 
next  example;  this  gives  a  correct  “top-down  expansion”  of  the  rule.  In  other  words, 
elementary  strong  components  can  be  manipulated  much  like  context-free  grammars. 


treating  predicates  in  the  component  as  nonterminals  and  predicates  of  lower  rank  as 
terminals.  We  shall  usually  restrict  attention  to  simple  cases,  in  which  there  is  only  one 
nontrivial  strong  component  and  all  other  predicates  are  EDB  predicates. 

Example  7.2:  The  following  is  an  elementary  chain  rule: 

p{X,Y,Z)  qi{X,U),p(U,V,W)MV,W,Y,Z). 

The  left  block  of  p  consists  of  its  first  argument,  and  its  right  block  consists  of  the  second 
and  third  arguments.  Thus  it  could  be  represented  first  as: 

X  YZ  X  U  VW  YZ 
p  9i  P  72 

and  then  even  more  succinctly  hs  P  —*  qiPqt,  adopting  the  notation  of  productions  in  a 
context  free  grammar.  We  associate  the  predicate  symbol  p  with  the  nonterminal  symbol 
P  in  the  grammar.  If  we  make  a  new  copy  by  subscripting  all  variables  with  1,  then  unify 
its  head  with  p(I/,  V,W)  and  “substitute,”  we  obtain  an  expanded  elementary  chain  rule: 

X  YZ  X  U  Ui  ViWi  VW  YZ 
p  7i  9i  P  92  92 

which  obviously  corresponds  to  the  derivation  P  ^  9i9iP9292*  D 

We  now  consider  several  cases  of  nonlinear  programs  with  an  elementary  chain  rule  as 
the  only  recursive  rule.  These  cases  are  in  join  normal  form,  have  one  IDB  predicate  p, 
and  have  several  EDB  predicates,  90,  9i»  92»  etc.  Although  we  can  write  them  and  think 
about  them  as  binary  predicates,  recall  that  each  variable  symbol  actually  represents  a 
vector  of  variables.  We  organize  the  cases  by  the  number  of  subgoals  in  the  recursive  rule. 
The  first  case  has  two  subgoals. 

Pi  p{X,Y):-p{X,U),p{U,Y). 
p(X,y)  :-9o(X,y). 

This  is  a  version  of  transitive  closure,  well  known  to  be  in  XC.  Note  that  the  left  and 
right  blocks  must  have  the  same  arity  in  cases  like  this  one,  where  there  are  consecutive  p 
subgoals. 

The  next  two  cases  have  three  subgoals. 

p,  p(x,y)  :-p(x,D),p(c/,y),p(y,y). 

p{X,Y)  :-qo{X,Y). 

P3  p(x,y)  :-p(x,t/),9i(t7,y),p(y,y). 

p(X,y):-qo(X,y). 

To  see  that  they  are  both  in  XC,  we  characterize  their  complete  derivation  trees,  and  show 
that  they  have  the  polynomial  fringe  property.  We  associate  the  grammars  G2  with  P j  and 
G3  with  Ps,  where  G2  consists  of  P  -*  PPP  \  go  and  where  G3  consists  of  P  P91P  |  9o- 


Theorem  7.3:  The  Basic  Theorem  problems  of  Pi,  Pj  emd  Ps  are  in  MC. 

Proof :  Because  the  rules  are  elementary  chain  rules,  it  is  sufficient  to  characterize  the 
fringes  of  the  complete  derivation  trees,  which  are  instantiated  elementary  chains.  It  is 
sufficient  to  prove  the  theorem  for  P]  and  Ps,  as  Pi  can  be  simulated  by  P3  with  qi  set 
to  the  diagonal  relation  consisting  of  a  tuple  9i(e,c)  for  every  EDB  constant,  e. 

An  easy  induction  on  the  number  of  rule  applications  shows  that  the  Pj  fringes  are  odd 
length  chains  of  go's,  that  the  P3  fringes  are  chains  of  the  form  {qoqi)*qo,  and  that  all  such 
chains  of  EDB  atoms  are  the  fringes  of  valid  complete  derivation  trees.  Now  suppose  some 
fringe  (of  either  P2  or  P3}  contains  two  occurrences  of  qo  in  an  odd-numbered  position  that 
both  have  the  same  instantiations  of  their  left  block  arguments;  i.e.,  there  is  a  9o(a,6i)  and 
somewhere  to  its  right  there  is  a  90(0,63)  in  the  fringe.  Then  by  removing  90  (a,  61)  and 
the  atoms  to  its  right,  up  to  but  not  including  90(0,63),  we  obtain  a  shorter  chain  that  is  a 
valid  fringe  of  a  derivation  tree  for  the  same  root  atom.  Thus  the  original  derivation  tree 
was  not  minimum.  Therefore,  for  every  atom  in  the  minimum  model,  there  is  a  derivation 
tree  for  that  atom  whose  fringe  does  not  have  such  a  repetition.  Since  the  left  block  of  9b 
can  only  be  instantiated  in  polynomially  many  ways,  it  follows  that  P3  amd  P3  have  the 
polynomial  fringe  property.  | 

The  above  proof,  although  not  difficult,  introduces  a  paradigm  for  showing  that  a 
program  has  the  polynomial  fringe  property: 

(1)  Characterize  the  fringes  that  the  rules  of  the  prograun  can  produce  with  the  help  of 

the  associated  grammar. 

(2)  Show  a  polynomial  bound  such  that  if  the  number  of  occurrences  in  the  fringe  of  an 

instantiated  atom  exceeds  that  bound,  then  that  fringe  is  not  minimum. 

(3)  Conclude  that  every  derivable  atom  has  a  derivation  whose  fringe  length  is  polynomi¬ 

ally  bounded. 

This  paradigm  is  developed  further  in  the  next  section. 

The  programs  examined  so  far  are  only  “weakly  nonlinear”  in  a  certain  sense.  Using 
induction  on  the  number  of  rule  applications,  it  is  easy  to  show  that  the  lineair  programs: 

p;  p(X,y):-9o(X,U),p(U,y). 

piX,Y):-qoiX,Y). 

P',  p(X,y)  ■.-qo{X,UUo{U,V),p{V,Y). 

p{X,Y)  :-9o(X,y). 

Pi  p(X,y)  :-qo{X,U)MU,V),p{V,Y). 

p(x,y)  :-9o(x,y). 

are  equivalent  to  Pi,  P3  and  P3,  respectively,  in  the  sense  of  Definition  7.3.  This  equiva¬ 
lence  also  follows  from  Lemma  8.6. 
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8  Nonlinear  Programs  in  A/C 

We  now  turn  to  programs  that  have  no  linear  equivalent.  When  these  programs  have  a 
close  analogy  with  a  “balanced  parentheses*  language,  we  can  draw  upon  that  analogy  to 
characterize  their  derivation  trees,  and  show  that  they  have  the  polynomial  fringe  property. 
We  “implement”  the  analogy  by  means  of  GSM  mappings. 

Definition  8.1:  Informally,  a  generalized  sequential  machine  (GSM)  is  a  nondeterministic 
finite  automaton  that  emits  an  “output  string*  in  addition  to  consuming  an  input  symbol 
on  each  “move.”  The  output  string  is  normally  over  a  different  alphabet  from  the  input.  It 
is  convenient  to  think  of  a  GSM  in  terms  of  a  graph  in  which  the  nodes  are  states  and  the 
arcs  are  labeled  “a/'y*  where  a  denotes  an  input  symbol  (consumed,  not  e)  and  7  denotes 
an  output  string  (emitted,  possibly  empty).  There  is  one  start  state  and  a  nonempty  set 
of  accepting  states.  If  a  is  an  input  string  to  GSM  M  and  (3  is  the  concatenation  of  the 
output  strings  emitted  by  some  accepting  computation  of  M  on  a,  we  call  /?  a  mapped 
element  of  a.  The  GSM  mapping  of  input  string  a,  written  M{a)  is  the  set  of  mapped 
elements  of  a.  A  GSM  is  e-free  if  each  arc  is  labeled  with  a  nonempty  output  string.  For 
formal  details,  see  [HU79).  □ 

We  now  introduce  some  terminology  for  discussing  strings  associated  with  a  context 
free  grammar.  Let  G  be  a  context  free  grammar  with  nonterminal  alphabet  V  and  terminal 
alphabet  T,  and  let  L  be  the  language  generated  by  G. 

Definition  8.2:  A  symbol  is  an  element  of  V  U  T;  a  string  is  a  sequence  of  symbols.  A 
letter  is  a  terminal  symbol,  i.e.,  an  element  of  T;  a  word  is  a  sequence  of  letters.  Frequently 
we  use  a  superscript,  such  as  a^,  to  designate  a  particular  occurrence  of  symbol  a.  The 
“1”  superscript  does  not  mean  the  first  symbol  of  the  string,  however.  A  nonterminal 
production  is  one  with  a  nonterminal  on  the  right  side;  a  terminal  production  is  one  with 
no  nonterminals  on  the  right  side.  A  sentential  form  of  G  is  a  string  produced  from  the 
stMt  symbol  by  zero  or  more  productions.  A  nonterminal  sentential  form  of  G  is  one  that 
uses  only  nonterminal  productions.  □ 

Definition  8.3:  The  Dyck  language  on  one  kind  of  parentheses,  Di,  is  the  context  free 
language  whose  grammar  is: 


Gd,  S^E\t 

EE\[E}]  tl 

Intuitively,  this  language  consists  of  all  “balanced  parentheses”  words,  including  the  empty 
word,  e.  We  denote  the  language  consisting  of  Di  concatenated  with  an  end-marker  $  by 

£>1$.  □ 


It  turns  out  that  working  with  Di$  avoids  many  technicalities  where  GSM  mappings 
are  concerned;  e.g.,  we  avoid  the  question  of  mapping  the  empty  word. 

To  characterize  sentential  forms  of  Di,  we  need  the  notion  of  “nesting  depth.”  It  is 
convenient  to  label  the  “points”  between  successive  symbols  of  a  string  with  integers.  The 
“point”  to  the  left  of  the  string  is  0;  the  point  following  the  i-th  symbol  is  *;  and  therefore 
the  point  to  the  right  of  a  string  of  length  n  is  n. 

Definition  8.4:  For  strings  over  an  alphabet  containing  C  and  ]  (and  possibly  other 
symbols)  we  assign  a  left-depth  to  each  point  by  starting  with  0  at  point  0,  and  moving 
left  to  right.  We  add  1  whenever  we  cross  an  t,  and  subtract  1  whenever  we  cross  a  ] . 
Similarly,  we  assign  a  right-depth  by  starting  with  0  at  the  point  to  the  right  of  the  string, 
and  moving  to  the  left.  Now  we  add  1  whenever  we  cross  an  ] ,  aoid  subtract  1  whenever 
we  cross  a  [.  For  strings  with  an  equal  number  of  [  and  ]  it  is  apparent  that  the  left-depth 
equals  the  right-depth  at  every  point,  so  we  can  call  the  common  value  the  depth  of  that 
point.  We  shall  restrict  attention  to  such  strings,  and  define  the  depth  of  the  symbols  in 
the  string.  We  define  the  depth  of  a  [  in  a  string  to  be  the  depth  of  the  point  to  its  left, 
and  define  the  depth  of  a  ]  to  be  the  depth  of  the  point  to  its  right.  For  other  symbols, 
the  depth  of  either  adjacent  point  may  be  taken.  □ 

Lemma  8.1:  A  word  over  { [,  ] }  is  in  Di  if  and  only  if  it  has  an  equal  number  of  [  and 
] ,  and  every  letter  has  a  nonnegative  depth. 

Proof :  (This  well-known  fact  is  easier  to  re-prove  than  to  find  a  reference  for.)  The  =► 
direction  is  trivial.  The  <=  direction  is  shown  by  induction  on  k,  where  2k  is  the  word 
length.  The  basis,  k  =  0  or  k  =  1,  is  immediate.  For  k  >  1,  assume  the  ^  direction  for 
words  of  length  less  than  2k. 

In  a  word  w  of  length  2k  in  which  every  letter  has  nonnegative  depth,  find  the  rightmost 
[  that  has  maximum  depth,  and  call  it  [’.  The  must  be  followed  by  a  ]  of  the  same 
depth,  which  we  denote  by  ]*.  The  depth  0  case  is  easy,  so  assume  the  depth  is  d  >  0. 
[*]  *  must  be  followed  by  another  ] .  If  they  are  preceded  by  a  [,  then  let  tu  =  u;i  [  [*]  *]  u>3. 
Clearly  WiCjws  has  all  nonnegative  depths,  so  is  in  Di  by  the  inductive  hypothesis,  from 
which  it  follows  that  WiEws  is  a  sentential  form  of  Gpi  (there  is  no  other  way  to  get  a 
[]  pair).  The  production  E  — »  [E]  shows  that  is  also  a  sentential  form;  hence 

w  E  Di.  If  is  preceded  by  a  ] ,  it  also  has  depth  d;  call  it  ]  *.  By  maximality  of  d  there 
is  a  [*  immediately  to  the  left  of  ]*.  Let  w  =  The  remainder  is  similar  to 

the  previous  case,  except  that  the  production  E  —*■  EE  is  used.  | 

Theorem  8.2:  (GSM  Mapping  Theorem)  Let  P  be  a  basic  logic  program  with  a 
distinguished  IDB  predicate  p,  other  IDB  predicates,  pi,P3, . . .,  amd  several  EDB  predicates 
of  which  p  depends  upon.  Let  P  have  only  elementary  chain  rules.  Let  G 
be  the  context  free  grammar  obtained  by  considering  P  as  the  start  symbol,  considering 
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P, Px.P],...  as  nonterminals,  and  considering  91,92,...  as  terminals.  The  productions  of 
G  correspond  to  the  rules  of  P  in  the  obvious  way.  That  is,  the  right  side  of  a  production 
corresponds  to  the  chain  of  subgoals  of  a  rule,  and  P,  Pi,  ...  correspond  to  p,  pi, 
respectively.  Let  L  be  the  language  generated  by  G.  If  L  is  the  GSM  mapping  of  Di$  for 
some  GSM  M,  then  P  has  the  polynomial  fringe  property,  hence  is  in  MC. 

Proof:  Let  F  be  the  fringe  of  a  minimum  complete  derivation  tree  of  P,  expressed  as 
am  elementary  chain,  for  some  atom  a  in  the  minimum  model.  By  ‘‘minimum”  we  mean 
minimum  fringe  length.  Let  F  be  the  corresponding  word  obtained  by  using  just  the 

A 

predicate  symbols  of  F.  Clearly  F  E  L.  Let  a  6  Di$  be  a  minimum  length  word  in 
M-^(P).  That  is,  F  G  M(a).  We  note  that  !P|  =  |P|  =  0(la|). 

From  now  on  we  consider  a  particular  computation  of  M  on  a  that  produces  F.  Let 
si, . . . ,  Sr  be  the  sequence  of  states  of  M  in  this  computation,  excluding  the  final  (accepting) 
state.  We  partition  F  into  subwords  {P<  [  Fi  =  Af (a,),  t  =  1, 2, . . . ,  r},  where  oti  •  •  •  a,  are 
the  letters  of  a.  We  correspondingly  partition  F  into  Pi,...,Pr.  Let  6<  be  the  left  block 
of  Fi,  that  is,  6,  is  the  vector  of  EDB  constants  whose  components  are  the  instantiations 
of  variables  that  occur  in  both  the  first  subgoai  of  Fi  2md  the  last  subgoal  of  Fi-i  (or  in 
the  head,  if  t  =  1).  We  define  an  ID  for  this  computation  of  M  on  a  that  outputs  F  as 
the  triple  ID{i)  =  {oti,Si,bi),  where; 

•  a,  is  the  next  symbol  to  be  read  by  M. 

•  Si  is  the  state  of  M  just  before  reading  Oi. 

•  6i  is  the  left  block  of  p. 

Furthermore,  let  d,  denote  the  depth  of  a,  in  a,  as  defined  in  Def.  8.4.  The  relationships 
are  illustrated  in  Fig.  2.  Note  that  the  depth  for  each  [  is  associated  with  the  “point”  to 
its  left,  and  the  depth  for  each  ]  is  associated  with  the  “point”  to  its  right. 

In  connection  with  one  computation  of  M  we  get  a  sequence  of  ID’s:  ID{1),. . . ,  ID{r), 
and  a  sequence  of  nonnegative  depths  0  =  di, . . . , d,  =  0- 
C/aim  1: 

For  1  <  i  <  j  <  r,  if  di  =  dy,  then  ID(i)  ^  ID{j). 

Proof  of  Claim  1: 

Suppose  there  are  distinct  positions  in  a,  say  i  <  j,  such  that  dj  =  d,  and  /£>(»)  =  ID{j). 
From  the  definition  of  depth  and  the  fact  that  =  a,  ,  it  follows  that  there  are  an  equal 
number  of  [  and  ]  in  the  subword  a,  •  •  •  ay_j.  It  follows  immediately  that  a'  G  Di%. 

Now  we  remove  the  subchain  p  through  Py_i  from  the  original  fringe  F.  What  remains 
is  still  an  elementary  chain,  as  now  6,,  which  equals  bi,  connects  properly  to  p_i  (or  to 
the  left  block  of  the  head,  if  i  =  1).  Also  we  remove  the  subword  p  through  Fj-i  from 
F,  and  we  remove  a,  •  •  •  q,_i  from  a,  giving  words  F'  and  o'.  The  situation  is  depicted  in 


9i(a,6),  gi(6,c),  qo{e,d). 


9j(®,y).  9o(y,«) 


a=  C  [  ]  [  C  ] 


di  =  0  1  1  1  2  2  1 


110  0 


Figure  2:  Derivation  fringe  for  p(a,  z),  ID  of  a  GSM  computation,  and  nesting  depth. 

Since  a,-  =  Sj,  M  was  in  the  same  state  before  reading  ot,-  as  before  reading  ay.  Abo, 
a,-  =  Oj  =  aj.  Thus  the  transition  a,-  —*■  ay+i  consuming  ay  and  emitting  Fj  b  legal  for  M. 
Define  a  computation  of  Af  on  a*  that  mimics  the  computation  on  a  except  on  a<  ”  •  ay.i. 
When  a[  is  reached,  make  the  transition  a,-  — »  ay+i  consuming  a|-  and  emitting  Thb 
computation  is  legal  for  M  operating  on  a',  so  F'  6  M(Di$).  Moreover,  by  the  minimality 
of  a  for  F,  we  have  that  ^  b  a  proper  subword  of  F. 

That  b,  |F'|  <  |F|  and  F'  is  the  fringe  of  a  complete  derivation  tree  for  the  same  atom 
a.  Thb  contradicts  the  assumption  that  the  original  derivation  tree  was  minimum.  It 
follows  that  ID{i)  ^  ID{j)  for  any  distinct  a,-  and  ay  at  the  same  depth.  □ 

We  next  show  that  the  depth  of  any  letter  in  a  b  polynomially  bounded  whenever  F 
corresponds  to  a  minimum  derivation.  The  argument  is  a  more  complicated  version  of  the 
one  that  showed  that  ID's  at  the  same  depth  must  be  distinct. 

Claim  2: 

Let  dmax  be  the  maximum  depth  of  any  letter  in  a.  Then  dmax  b  polynomially  bounded. 
Proof  of  Claim  2: 

Let  m,  1  <  m  <  r  be  a  position  of  maximum  depth;  i.e.,  o,^  has  depth  dmax-  Select  a 
subsequence  (io, . . . ,  of  (1, . . . ,  m)  such  that: 

•  djj,  the  depth  of  a,*,  equals  k. 

•  dj  >  fc  for  all  t  such  that  u  <  »  <  m. 

We  call  this  the  left  subsequence.  Clearly  lD{ik)  b  defined  for  each  k,  0  <  k  <  dmax 
(*<*«.«  =  Similarly,  select  a  right  subsequence  (joi •  •  •  >  Jd*..)  of  (r,r  -  1,. . .  ,m)  such 
that: 

•  dy.,  the  depth  of  ay*,  equals  k. 


Figure  3:  Illustration  of  Claim  1.  IDs  cannot  repeat  at  same  level. 

•  dj  >  k  for  all  j  such  that  m<  j  < 

Again  ID{jit)  is  defined  for  each  k,0  <  k  <  dmas-  Finally,  for  each  depth  0  <  d<  dmat  we 
form  the  pair  Z4  =  {ID{ii),ID{ji)). 

Suppose  there  are  two  distinct  integers,  0  <  d  <  e  <  d^ag,  such  that  =  Z,.  Let  us 
simplify  subscripting  by  setting  *  =  j  =  t*,  k  =  jt,  and  /  =  jj.  Note  that  i  <  j  <  k  <  1. 
We  now  remove  two  subwords  of  a,  namely  a,-  •  •  •  ay_i  and  a*  •  •  •  a/-i,  obtaining  a  shorter 
word  a',  a!  clearly  has  an  equal  number  of  [’s  and  ] ’s,  so  “depth"  is  defined.  The  main 
point  is  that  the  depth  in  a  of  all  letters  in  a,-  ■  ■  ■  ai^-i  is  at  least  e,  so  that  the  depths  of 
corresponding  letters  in  a'  are  at  least  d,  hence  nonnegative.  Th\is  a'  is  also  in  Di$.  We 
also  define  P  and  F'  by  removing  subwords  and  subchains  corresponding  to  the  letters 
removed  from  a.  The  situation  is  depicted  in  Fig.  4. 

A 

It  is  straightforward  that  Af  maps  a'  into  F’  with  an  au:cepting  computation,  and  that 
F'  is  the  fringe  of  a  complete  derivation  tree  that  derives  the  saune  atom  as  F  derived.  The 
details  are  similar  to  the  argument  for  Claim  1.  Since  a  was  minimal  for  F,  F'  is  strictly 
shorter  than  F.  This  contradicts  the  assumption  that  F  was  minimum;  therefore  Z^  and 


(a,',s,',6j)  —  {aj,Sj,bj) 


b/  b/  b/  b/  / 


Figure  4:  IlluBtration  for  Claim  2.  Certain  paira  of  IDs  cannot  repeat.  Dotted  curve 
denotes  positions  not  in  left  or  right  subsequence. 

Zt  must  have  distinct  values  whenever  d  ^  e.  There  are  only  polynomially  many  possible 
values  for  Zj,  so  dmax  is  polynomially  bounded.  □ 

To  summarize,  the  depth  of  a  is  polynomial,  and  the  same  ID  cannot  occur  twice  at 
the  same  depth,  thereby  bounding  the  number  of  symbols  of  a  at  each  depth,  so  the  length 
of  a  is  polynomially  bounded.  Since  |F|  =  0(|a|),  the  theorem  follows.  | 

A  sequential  transducer  is  essentially  a  GSM  that  can  also  madie  transitions  and  produce 
output  without  consuming  an  input  symbol;  such  transitions  are  called  e-moves.’  We 
remark  that  this  generalization  gains  us  nothing,  as  sequential  transducer  mappings  of 
Di$  give  the  saime  family  of  languages  as  GSM  mappings  of  Di$.  To  see  this,  suppose  that 
w  is  a  possible  output  string  of  sequential  transducer  T  operating  on  input  a  €  Di$,  where 
o  is  written  with  e’s  inserted  wherever  the  sequential  transducer  will  make  an  e-move  in 
its  computation.  Form  ')  by  replacing  each  [  in  a  by  t  [,  replacing  each  ]  in  a  by  ]  ] ,  and 
replacing  each  e  in  a  by  [] .  A  GSM  is  easily  constructed  (not  dependent  on  a,  of  course) 

^These  moves  are  not  to  be  confused  with  moves  that  consume  an  input  symbol,  but  output  nothing. 
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that  imitates  T  to  produce  u;  as  a  computation  on  7. 

As  an  application  of  the  theorem  on  GSM  mappings  of  D\%  we  consider: 

P4  p{X,Y)  ■.-qi{X,U),p(U,V),q^{V,W),p[W,Y). 
p[X,Y)  :-qo{X,Y). 

Let  G4  and  L4  be  the  grammar  and  language  corresponding  to  P4.  G4,  given  by  P  -+ 
q\Pq2P  1  9o-  We  observe  that  all  words  in  L4  have  an  equal  number  of  ^t’s  zmd  qj's,  and 
that  for  any  fc  >  0,  words  of  the  form  91(^092)*  9i(9o92)‘  •••  9* (9092)*  9o  are  in  L4.  It 
appears  that  all  such  words  cannot  be  recognized  by  any  Snite-turn  push-down  automaton, 
as  defined  in  [GS66,HU79],  hence  are  not  in  a  linear  context-free  language.  We  therefore 
do  not  expect  to  find  an  equivalent  linear  program  for  P4,  as  we  did  for  Pi,  Pj,  and  Ps. 

To  apply  the  GSM  mapping  theorem,  we  define  a  GSM  M4  as  shown  in  Fig.  5.  The 
arc  label  “a/7”  means  “consume  input  symbol  a  and  output  string  7. 


Figure  5:  The  GSM  A/4,  with  start  state  0  and  accepting  state  1. 

Lemma  8.3:  A  word  w  of  90,  91,  and  92  symbols  is  in  L4  if  and  only  U  w  E  A/4(Z?i$). 

Proof :  (=>)  Proceed  by  induction  on  fc,  the  number  of  applications  of  the  nonterminal 
production  of  G4.  Note  that  lii;|  =  3fc  -I-  1.  For  fc  =  0,  A/4($)  =  {90}.  For  fc  >  0, 
assume  the  lemma  is  true  in  the  =>  direction  for  derivations  with  less  than  fc  nonterminal 
production  applications.  The  first  production  of  a  fc  step  derivation  of  w  replaces  the 
starting  P  symbol,  say  P°,  by  qiP^qzP^.  Then  and  P*  have  derivations  of  fewer  than 
fc  nonterminal  steps  that  yield  10'  and  such  that  w  =  qiW^q2W^.  By  the  inductive 
hypothesis,  there  exist  a  and  /?  in  Z?i$  that  are  mapped  by  A/4  into  and  u/*.  Let  a'  be 
a  with  its  $  end-marker  removea  and  define 


It  is  clear  that  ^  G  D\%.  Now  we  claim  that  The  only  change  ia  that 

M4  emits  the  go  at  the  end  of  (as  part  of  90^2)  when  it  reads  the  “new”  ]  of  <7  instead 
of  when  it  reads  the  $  of  a. 

(•«=)  Proceed  by  induction  on  the  length  2fe  +  1  of  the  word  in  Di%.  The  base  case, 
jb  =  0,  is  trivial.  For  k  >  0,  assume  the  lemma  holds  in  the  <=  direction  for  shorter  words, 
which  are  of  the  form  2ik'  +  1  for  some  0  <  <  k.  In  a  word  7  of  length  2A:  +  1  find 

the  rightmost  [  letter  that  has  depth  0,  and  call  it  [^.  This  letter  must  either  be  the  first 
letter  of  7,  or  mu&t  be  preceded  by  a  ]  of  depth  0.  There  is  precisely  one  ]  of  depth  0  to 
the  right  of  [^,  and  it  is  immediately  followed  the  the  $  end-marker;  call  this  letter  ]^. 

If  is  not  the  first  letter  of  7,  partition  7  into  a'/?,  where  consists  of  and  all 
following  letters.  Let  a  be  a'  with  $  appended.  It  is  clear  that  both  a  and  0  ar.'  in 
i7i$.  Let  M|(a)  =  and  =  {w^}.  (The  mappings  are  singleton  sets,  since 

M4  is  deterministic.)  By  the  inductive  hypothesis,  and  w’  are  in  L4.  But  then  so  is 
w  —  qiw^qjW^,  and  again  it  is  easy  to  see  that  —  {w}. 

If  is  the  first  letter  of  7,  partition  7  into  [^a']^$.  By  definition  of  [^,  o'  (as  a  subword 
of  7)  contains  no  letters  of  depth  0.  Consequently,  o'  (as  a  word)  is  in  Di.  Let  a  be  o' 
with  $  appended;  a  G  Di%.  Let  M^{o)  =  {w*}.  By  the  inductive  hypothesis,  is  in  L4. 
But  then  so  is  u;  =  and  again  it  is  easy  to  see  that  ^4(7)  =  {w}.  | 

Corollary  8.4:  P4  has  the  polynomial  fringe  property,  hence  is  in  UC. 


Corollary  8.5:  The  following  prograun  is  in  UC'. 


p{X,  Y)  9,  (X,  U),p{U,W),  p(W,  Y ) . 
p(X,y)  :-9o(X,r). 


Proof :  For  each  EDB  constant  c  add  an  atom  9i(c,c)  to  the  EDB,  then  run  the  algorithm 
on  P4.  I 

For  our  next  application  of  the  GSM  Mapping  Theorem,  we  show  that  Ps.0.2  can  be 
generalized  to  a  family  of  elementary  single  rule  programs  with  only  one  nonrecursive 
subgoal  that  are  in  XC.®  The  next  program  represents  a  family  of  programs  indexed  by  i 
and  j,  where  0  <  1  <  j,  and  j  >  0.  The  notation  denotes  i  occurrences  of  P. 

Psij  p(x,y) p(x,c;i),p(i^i,t^,) . p{Ui,v), 

<7i(v",v^i),p(vyi,iv,),p(iy„vy,),...,p(iv„y). 

p(X,y);-  9o(x,y). 

which  is  succinctly  expressed  by  the  grammar  Gs.,-.,  consisting  of  P  — ►  P^'^qiP^^^  |  qo-  The 
complexity  of  Ps.i.;  in  the  general  case  is  open.  However,  we  can  show  that  the  cases  i  =  0 

®Pure  recursions,  P  —*  P  ■  ■  ■  P  P  \  qo,  are  obviously  in  MC,  being  equivalent  to  P  —*  qo  ■  qo  P  \  qo- 


and  1  =  1  are  in  }JC  for  all  j.  The  case  t  =  1  relies  on  the  following  lemma  to  show  that 
it  is  essentially  the  same  as  the  case  t  =  0.  It  is  immediate  that  two  elementary  programs 
are  equivalent  in  the  sense  of  Definition  7.3  if  their  corresponding  grammars  generate  the 
same  language. 

Lemma  8.6:  Let  be  a  grammar  with  nonterminal  symbol  P  in  which  the  only  produc¬ 
tions  for  P  are: 

P-^P'lP\  90 

Here  go  is  a  terminal  and  7  is  a  string  possibly  containing  both  terminals  and  nonterminals. 
Then  L,  the  language  generated  by  G,  is  also  generated  by  G',  where  G'  is  obtained  from 
G  by  replacing  the  productions  for  P  by: 

P  qo'lP  1  9o 

Proof :  We  observe  that  the  derivation  in  G: 

P  =»  P^P  ^  P^P^P 

is  ambiguous,  in  that  the  third  string  can  be  derived  by  replacing  either  the  first  or  the 
last  P  in  P7P;  consequently,  when  parsing  a  string,  we  can  assume  that  such  derivations 
are  always  obtained  by  replacing  the  last  P.  It  follows  that  any  word  in  L  has  a  derivation 
in  which  the  P  immediately  to  the  left  of  7  is  always  replaced  by  90,  and  never  by  P7P. 
In  other  words  L  is  also  generated  by  the  grammar  G'.  | 

To  apply  the  GSM  mapping  theorem  in  the  cases  t  =  0  and  t  =  1,  we  define  Ms.,-.,-  as 
shown  in  Fig.  6.  Observe  that  Ms.,-.,  uses  more  of  the  power  of  the  GSM  Mapping  Theorem 
than  M4  in  that  it  accepts  only  a  subset  of  Di%,  yet  it  does  not  use  the  full  power,  since 
it  is  deterministic. 

Lemma  8.7:  Let  i  =  0  or  t  =  1.  A  word  w  of  90, and  91  symbols  is  in  Ls.*.,-  if  and  only  if 
w  €  Af5.,-.j(Z?i$). 

Proof :  It  is  sufficient  to  prove  the  case  *  =  0,  for  then  the  case  i  =  1  follows  by  Lemma  8.6. 

{=>)  Proceed  by  induction  on  k,  the  number  of  applications  of  the  nonterminal  pro¬ 
duction  of  Gb.o.j-  Note  that  k  is  the  number  of  91  symbols  in  w,  and  |u;|  =  (*  +  j)k  +  1. 
For  k  =  0,  M6,o.>($)  =  {90}-  For  k  >  0,  assume  the  lemma  is  true  in  the  =>  direction 
for  derivations  with  less  than  k  nonterminal  production  applications.  Let  the  rightmost 
occurrence  of  91  in  w  be  in  Wm-  Let  P  =»  u;  be  a  leftmost  derivation,  i.e.,  at  each  step 
the  leftmost  nonterminal  symbol  is  replaced  by  the  right  hand  side  of  a  production.  Then 
the  last  use  of  P  — +  91 P^^^  in  this  leftmost  derivation  produces  the  occurrence  of  91  in 
Wm-  There  are  at  lezist  j  occurrences  of  90  after  u;^.  Thus  the  substring  Wm“'V)m+j 
contains  9i9o^^  We  define  w'  by  replacing  this  substring  of  u;  by  a  single  90.  Clearly 


Figure  6:  The  GSM  with  start  state  0  and  accepting  state  j-l,  where  i  =  0  or  »  =  1. 

w'  €  and  the  inductive  hypothesis  applies  to  show  that  there  is  some  o'  €  such 
that  itf'  €  Partition  a'  into  OiOtj  such  that  Ms.o.y  emits  in  response  to  the 

first  symbol  of  aj.  Thus  Oj  contains  only  ]’s.  Now  form  a  by  inserting  j  —  1  [’s  followed 
hy  J  —  1  ] ’s  between  a'j  and  o',.  Then  xu  G  Mt.o.j(a). 

(«=)  Proceed  by  induction  on  the  length,  2k{j  - 1)  + 1  of  the  word  a  in  Z?i$.  Note  that 
the  [’s  of  a  must  form  k  groups  of  j  -  1  each,  or  else  Afs.o.y  will  not  accept  a.  The  base 
case.  A:  =  0,  is  trivial.  For  k  >  0,  assume  the  lemma  holds  in  the  <=  direction  for  shorter 
words.  In  a  word  a  of  length  2k(j  —  1)  +  1  find  the  rightmost  group  of  j  —  1  [’s;  they  must 
be  followed  by  j  —  1  ]’s.  Remove  these  2(j  —  1)  letters,  creating  a'.*  Clewly  a*  G  Di$ 
and  is  accepted  by  Ms.o.y.  Let  w'  be  the  string  emitted  by  Ms.o.y  operating  on  a',  and  let 
be  the  symbol  emitted  in  response  to  the  letter  immediately  after  the  removed  letters. 
V)'  €  I-s.o.y  hy  the  inductive  hypothesis,  w'^  must  be  a  qo  because  there  are  no  [’s  further 
to  the  right  in  ad.  In  the  derivation  tree  for  w'  replace  the  derivation  for  u;{„,  which  was 
P  =»  qo,  by  the  derivation  P  =>  qiP^’^;  then  replace  these  new  P’s  by  go’s,  giving  a  new 
string  w.  Clearly,  Ms.o.y(a)  =  {tu}  C  Ls.o.y-  I 

Theorem  8.8:  Let  t  =  0  or  t  =  1.  Pst.y  has  the  polynomial  fringe  property,  hence  is  in 

MC. _ 

*There  may  be  more  ] ’s  to  the  right  of  the  removed  letters. 


Proof:  Immediate  from  Lemma  8.7  and  the  GSM  Mapping  Theorem.  | 

We  obtain  the  “mirror  images*  Pf ,  Pfo.,  Pf  i.y  by  reversing  the  subgoal  chains  in 

the  obvious  manner.  They  eure  also  in  )fC,  therefore. 

In  Section  9,  Definition  9.1,  we  define  a  recursive  subgoal  in  an  elementary  chain  rule 
to  be  cut  off  if  it  not  at  either  end  of  the  subgoal  chain  and  is  either  partially  or  completely 
surrounded  by  nonrecursive  subgoals.  Cut-off  recursive  subgoals  play  an  important  part  in 
simulating  Boolean  circuits  for  P -completeness  proofs,  apparently  because  such  subgoals 
have  no  variables  in  common  with  the  head  of  the  rule.  In  that  section  we  conjecture  that 
any  elementary  single  rule  program  in  which  the  recursive  rule  has  two  cut  off  recursive 
subgoals  is  P-complete. 

Recall  that  elementary  programs,  being  in  join  normal  form,  cannot  have  two  consec¬ 
utive  nonrecursive  subgoals;  consequently,  any  rule  with  k  nonrecursive  subgoals  has  at 
least  k  —  1  cut  off  recursive  subgoals,  and  may  have  more. 

Based  on  our  conjecture,  and  assuming  MC  ^  the  following  program  appears  to  be 
the  longest  elementary  single  rule  program  that  is  in  A/ C  and  whose  recursive  rule  contains 
two  EDB  subgoals. 

Pe  p{X,Y)  :-piX,Tlq^iT,Ulp{U,V),qi{V,W),p{W,Y). 
p{X,Y)  :-qoiX,Y). 

Let  Ge  and  L«  be  the  grammar  and  language  corresponding  to  Pe.  Thus  Ge  is  given  by 
P  — ►  PqiPqtP  I  qo.  That  Pe  is  in  MC  is  immediate  from  Lemma  8.6  and  the  fact  that 
P4  is  in  MC, 

This  is  about  as  fax  as  we  can  go  with  elementary  single  rule  programs  in  MC.  We 
shall  see  in  the  next  section  that  several  ways  of  extending  such  rules  with  more  subgoals 
(remaining  in  join  normal  form)  result  in  programs  that  are  ^-complete.  We  conjecture 
that  this  is  always  the  case. 

However,  the  GSM  Mapping  Theorem  applies  to  programs  with  more  than  one  recursive 
rule,  provided  each  that  rule  is  an  elementary  chain  rule. 

Example  8.1:  In  this  example  a  single  nonelementary  chain  rule  is  first  transformed  into 
two  elementary  chain  rules,  then  the  theorem  is  applied. 

Pt  piX,Y)  :-p(A',K),fl-,(K,W),p(r,W). 
p{X,Y):-qo{X,Y). 

It  is  convenient  to  identify  p^(y, X)  with  p{X,Y),  q§{Y,X)  with  go(X,Y'),  and  qf(y,X) 
with  qi(X,y).  This  leads  to  the  elementary  chain  rules: 

p;  p(x,y)  :-p(x,y),g,(y,w),p«(w,y). 
p{X,y)  :-9o(X,y). 

p«(x,y):-p(x,y),<7f(y,w),p«(w,y). 
p«(X,y)  :-q^{X,Y). 


Let  Gj  and  Lj  be  the  grammar  and  language  corresponding  to  P7.  Thus  G7  is  given  by 
the  following  productions  (P  is  the  start  symbol): 

P  -»  PftP*  1  go 
P*  PgfP*  I  g? 

It  is  easy  to  show  that  Lj  consists  of  certain  odd-length  words  with  go  or  g^  in  all  the 
odd  positions,  and  gi  or  gf  in  the  even  positions.  The  first  position  must  contain  qo. 
Let  us  interpret  every  symbol  after  the  first  as  a  left  pairenthesis  if  it  does  not  have 
superscript  R  and  as  a  right  parenthesis  if  it  does  have  superscript  R.  The  specification 
of  Lj  is  completed  by  requiring  that  the  symbols  after  the  first,  thus  interpreted,  comprise 
a  "balanced  pairentheses”  word.  The  proof  that  this  characterizes  L7  is  a  straightforward 
induction  in  each  direction,  and  is  omitted. 

To  apply  the  GSM  mapping  theorem,  we  define  Mr  as  shown  in  Fig.  7 
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Figure  7:  The  GSM  Mj,  with  start  state  0  and  accepting  state  3. 


Clearly,  M7{D\%)  implements  the  chairacterization  of  Lt  given  above.  Consequently 
(subject  to  the  proof  of  that  characterization),  P7,  and  hence  P7,  are  in  kC.  □ 

Unfortunately,  the  idea  of  replacing  nonelementary  chains  by  a  set  of  elementary  chains 
does  not  generalize  to  (truly)  tertiary  predicates,  at  least  as  far  as  we  can  see. 


Example  8.2:  Consider  the  programi: 

Ps  p{X,Y,Z)  :-p{X,U,V)MU,V,W),p{Y,W,Z). 
p(X,Y,Z)  :-qo(X,Y,Z). 


We  can  define  Pi, . . .  ,P6  to  be  p  under  the  various  permutations  of  its  arguments,  but  we 
cannot  maintain  the  left  block  as  a  single  argument  in  the  resulting  rules.  Consequently 
some  “substitutions”  are  incompatible,  and  the  CFG  analogy  breaks  down.  □ 

9  P-Complete  Logical  Query  Programs 

In  this  section  we  show  ^-completeness  for  a  number  of  logical  query  programs  that  are  in 
some  sense  just  beyond  the  boundaries  of  the  ones  we  have  already  considered.  Assuming 
that  UC  ^  P t  this  indicates  that  some  of  the  “templates”  for  logical  query  programs 
of  the  type  we  are  considering  cannot  be  further  generalized,  at  least  not  in  obvious  ways. 
Other  formulations  of  logical  query  programs  are  possible,  of  course. 

In  particular,  Cosmadakis  and  Kanellakis  [CK85]  have  studied  formulations  with  more 
of  a  relational  database  “flavor,”  for  which  they  coined  the  name  “sirup”  (for  Single  RUle 
Program) .  For  example,  they  have  obtained  some  results  for  typed  rules,  which  are  defined 
in  analogy  with  typed  tableaux.  E.g.,  if  U  appears  in  the  first  argument  of  one  occurrence 
of  p,  and  appears  in  the  second  argument  of  some  other  occurrence  of  p  in  the  same  rule, 
then  that  rule  is  not  typed. 


9.1  P-Complete  Elementary  Single  Rule  Programs 

Elementary  chain  rules,  as  remarked  before,  have  a  close  analogy  with  context  free  gram¬ 
mars.  The  GSM  Mapping  Theorem  states  that  GSM  mappings  of  Di$  are  in  MC.  Yet  it  is 
known  that  any  CFL  can  be  represented  as  a  GSM  mapping  of  I?2,  the  Dyck  language  on 
2  kinds  of  parentheses  [Gre73].  It  is  instructive  therefore  to  demonstrate  that  elementary 
chain  programs  can  be  P -complete. 

We  now  consider  the  smallest  (in  total  literal  count)  P-complete  elementary  single  rule 
program: 

P„  p(A:,y)  :-gi{X,U),p(U,V),p(V,W),gi{W,Y). 
p(X,V}  :-go(X,y). 

The  proof  that  Pn®  is  P-complete  provides  a  paradigm  that  can  be  used  for  many 
programs.  Observe  that  the  subgoal  chain  has  two  occurrences  of  p  that  are  ”cut  off” 
from  the  end  of  the  chain  by  91  atoms.  The  basic  idea  is  that  when  the  rule  is  instantiated, 
the  head  ”  represents”  the  output  of  a  particular  gate  of  a  circuit,  and  the  two  recursive 
subgoals  ” represent”  its  inputs.  We  want  to  simulate  the  circuit  by  setting  up  an  EDB 
based  on  the  circuit  that  has  this  property  for  each  gate  g  in  the  circuit:  p{g0,g9)  is  true 
(derivable)  if  and  only  if  gate  g  outputs  a  1. 

A  useful  intuition  is  to  interpret  p{g0,g9)  also  as  meaning  there  is  an  electrical  path 
from  gO  to  g9  allowing  current  to  flow.  If  g  is  an  and  gate,  we  want  the  only  possibilty  to 


°Our  numbering  of  programs  has  some  gaps 


be  a  “series”  path  that  goes  through  one  input,  returns  to  a  point  associated  with  g  {gA  in 
the  case  of  Pn),  and  then  goes  through  the  other  input.  Thus  current  can  flow  through 
g  if  and  only  if  it  can  flow  through  both  of  g's  inputs.  For  an  or  gate  we  want  “parallel” 
paths,  so  that  current  can  flow  through  g  if  and  only  if  it  can  flow  through  at  least  one  of 
its  inputs. 

We  now  describe  a  subclass  of  monotone  Boolean  circuits  that  can  be  simulated  by  Pn. 
First,  let  us  designate  the  class  of  all  monotone  Boolean  circuits  by  C,  and  the  subclass  of 
monotone  Booleain  circuits  with  only  two  inputs,  which  can  be  designated  left  and  right 
su-bitrarily,  by  Ci.  It  is  well  known  that  any  circuit  in  C  caji  be  transformed  in  log  space 
to  an  equivalent  circuit  in  Ci. 

We  designate  our  special  subclass  by  Cs*  The  binary  circuit  elements  for  Ci  are  and 
and  or  gates  with  precisely  two  distinguished  inputs,  called  the  left  input  and  right  input, 
and  one  or  two  distinguished  outputs,  a  left  output,  or  a  right  output,  or  both.  In  our 
terminology  an  output  of  one  gate  connects  as  input  to  precisely  one  other  gate;  “fan  out” 
is  achieved  by  having  multiple  outputs  on  a  gate.  In  addition,  there  are  source  elements 
with  no  inputs  and  one  or  two  outputs  that  emit  logical  Os  or  Is,  called  0  and  1  gates 
appropriately.  Their  outputs  are  also  distinguished  as  left  or  right.  Finally,  for  a  circuit 
to  be  in  Cj,  we  require  that  only  left  outputs  be  connected  to  left  inputs  and  only  right 
outputs  be  connected  to  right  inputs. 

Lemma  9.1:  There  is  a  log  space  transformation  from  C  to  Ct  that  preserves  the  output 
values  of  circuits. 

Proof :  Suppose  some  element  y  in  a  Ci  circuit  has  outputs  that  are  connected  to  m  >  1 
left  inputs.  Let  the  left  and  right  inputs  to  g  be  ei  and  Cr,  respectively.  Transform  the 
circuit  by  introducing  an  additional  and  gate  y'  and  an  additional  1  gate  yj.  Let  gate  h 
be  one  of  the  gates  connected  to  an  output  of  y.  Connect  m  —  1  outputs  from  y'  to  all  of 
the  elements  that  y  was  connected  to,  except  h,  and  remove  those  y  connections;  designate 
the  y  output  that  goes  to  h  as  the  left  output  of  y,  and  give  y  a  right  output  that  connects 
to  the  right  input  of  y'.  Finally,  connect  the  left  output  of  yi  to  the  left  input  of  y'.  It 
is  evident  that  this  produces  an  equivalent  circuit,  amd  that  the  transformation  can  be 
repeatly  carried  out  in  log  space  until  no  gate  has  outputs  connected  to  more  than  one  left 
input.  A  corresponding  transformation  can  remove  outputs  connected  to  multiple  right 
inputs.  I 

Therefore,  we  are  justified  in  calling  a  basic  logic  program  F-complete  if  we  can  use 
it  to  simulate  circuits  in  Cj-  More  precisely,  we  need  a  log  space  transformation  from  Ct 
circuits  into  EDB  relations  and  a  particular  “theorem”  to  decide,  such  that  the  theorem 
holds  in  the  extended  logic  program  if  and  only  if  the  circuit  outputs  a  1. 

We  now  demonstrate  such  a  transformation,  Tu,  for  Pu.  This  transformation  forms 
EDB  constants  by  concatenating  the  “name”  of  a  gate  with  a  digit  0-9,  which  designates 


the  "purpose”  of  that  constant.  For  example,  if  e  is  the  name  of  a  gate,  constants  eO  and 
e9,  among  others,  will  be  created.  One  "purpose”  of  these  constants  is  that  p(eO,  e9)  should 
be  true  if  and  only  if  gate  e  outputs  a  1  in  the  circuit.  In  particular,  if  e  is  a  1  gate,  then 
Til  will  put  go(eO,e9)  into  the  EDB. 

First  we  shall  informally  explain  the  principle  upon  which  Tu  is  based.  Expand  the 
recursive  rule  by  substituting  once  for  each  recursive  subgoal.  In  terms  of  a  derivation,  we 
have 

P  =>  qiPPqi  =»  qi  qippqi  qippqi  qi 

Then  consider  an  and  gate  e  with  left  input  /  and  right  input  g,  and  instantiate  the 
expanded  rule  as  follows  (indentation  is  only  to  highlight  relevant  phrases): 

p(e0,e9)  fi(eO,el), 

(el,  /O) ,  p(/0,  /9) ,  p(/9,  /2) ,  qi  (/2,  e4) , 

9i  (e4,  Sf7) ,  p(g7,  ffO) ,  p(gO,  g9) ,  qi  (g9,  e8) , 

9i(e8,e9). 

We  next  replace  p  subgoals  whose  arguments  are  not  of  the  form  (sO,  x9)  by  corresponding 
qo  subgoals,  and  represent  the  result  more  succinctly  as: 

eO  «1  fO  /9  /2  e4  gl  gO  g9  e8  e9 

9i  71  P  70  7i  7i  70  P  7i  7i 

We  want  p(c0,e9)  to  be  derivable  if  and  only  if  both  p(/0,/9)  and  p(90,gr9)  are 
derivable.  Tu  will  put  the  following  atoms  into  the  EDB  to  make  this  come  about: 

•  gi(el,/0)  and  7i(/2,c4)  because  e  has  left  input  /; 

•  7i(<4,p7)  and  7i(p9,e8)  because  e  has  right  input  g\ 

•  9i(e0,el)  amd  9i(e8,e9)  because  e  has  inputs; 

•  7o(/9, /2),  7o(/7, /O),  70(79,72),  and  70(77,70)  because  the  outputs  of  /  and  7  are 
used. 

The  procedure  for  an  or  gate  e  with  left  input  /  and  right  input  7  is  obtained  by 
modifying  the  and  procedure.  (We  eschew  a  shortcut  that  is  possible  in  this  case,  in  order 
to  demonstrate  a  technique  that  is  generally  applicable.)  Expand  the  rule  as  indicated  by 
the  derivation: 

P  =»  qiPPqi  =>  7i  qiPPqi  Pqi 

Instantiate  the  expanded  rule  twice  and  replace  certain  P’s  by  70's  as  before.  The  first 
expansion  contains  constants  related  to  the  left  input  of  e: 


s 


and  the  second  expansion  contains  constants  related  to  the  right  input  of  e: 


eO  el  gO  g9  g2  e4  e8  e9 
9i  P  Qo  9i  9o  9i 

We  want  p(eO,  e9)  to  be  derivable  if  and  only  if  p(/0,  /9)  or  p(gO,  g9)  (or  both)  are  derivable. 
Til  will  put  the  following  atoms  into  the  EDB  to  make  this  come  about: 

•  9i(el,/0)  and  9i(/2,e4)  because  e  has  “or”  input  /; 

•  9i(«l»90)  an<l  91(92,  e4)  because  e  has  “or”  input  g; 

•  9b(e4,  e8)  because  e  is  an  or  gate; 

•  9i(e0,el)  and  9i(e8,e9)  because  e  has  inputs; 

•  9o(/9,/2),  9o(/7,/0),  go(ff9,g2),  and  go(if7,gO)  because  the  outputs  of  /  and  g  are 
used. 

Clearly,  Tn  can  run  in  log  apace  and  it  produces  an  EDB  that  will  enable  Pn  to  derive 
p(x0,x9)  if  gate  x  outputs  a  1.  In  the  next  lemma  we  show  that  the  “only  iP  direction 
also  holds. 

First  we  sununarize  the  EDB  produced  by  Tn  in  Fig.  8,  according  to  the  gate  appearing 
in  the  left  block  of  each  EDB  fact.  Let  x  stand  for  any  gate  in  the  circuit  C. 


EDB  contains: 

if  z  is: 

9o(xO,z9) 

a  1  gate 

9o(z4,z8) 

an  or  gate 

9o(x7,iO) 

any  gate 

qo{x9,x2) 

any  gate 

9i(z0,il) 

an  and  or  an  or  gate 

9i(xl,/0) 

an  and  gate  with  left  input  / 

9i(xl,/0) 

an  or  gate  with  either  input  / 

9i(z2,/i4) 

left  input  to  and  gate  h 

9i(z2,/i4) 

either  input  to  or  gate  h 

9i(z4,97) 

an  and  gate  with  right  input  g 

9i(z8,z9) 

an  and  or  an  or  gate 

?i(z9,  h8) 

right  input  to  and  gate  h 

Lemma  0.2:  Let  C  be  a  circuit  in  Cs  and  let  Tii(C)  be  the  associated  EDB.  Let  x  be  any 
gate  in  C.  We  define  /*  for  all  fc  >  0  to  be  the  “partial  model”  consisting  of  p  atoms  that 
are  derivable  by  Pn  extended  with  rii(C’)  with  at  most  k  applications  of  the  recursive  rule. 
Then  the  following  hold  for  all  k,  and  in  particular  for  the  minimum  model  of  PiiU7ii(C): 

(1)  Neither  p(z2,  Z)  nor  p(x8,  Z)  is  in  h  for  any  Z. 

(2)  If  p(x9,  Z)  is  in  /*,  then  Z  =  x2.  If  p(x7,  Z)  is  in  /*,  then  Z  =  zO. 

(S)  If  z  is  an  and  gate  with  left  input  /  and  p(zl,^)  is  in  7^,  then  Z  =  e4  and  p(/0, /9) 
is  also  in  Ik- 

(4)  If  z  is  an  and  gate  with  right  input  g,  and  p(z4,  Z)  is  in  Ik,  then  Z  =  z8  and  p(90,  j;9) 

is  also  in  Ik- 

(5)  If  z  is  an  or  gate  with  inputs  /  and  g  and  p(zl,  Z)  is  in  Ik,  then  Z  =  z4  and  p(/0,  /9) 

or  p(90,  ^9)  is  also  in  Ik- 

(6)  If  z  is  an  and  or  an  or  gate  and  p(z0,  Z)  is  in  Ik,  then  Z  =  z9  and  both  p(zl,z4] 

and  p(z4,z8)  are  in  Ik- 

Proof  :  We  use  induction  on  k,  the  number  of  recursive  rule  applications.  Note  that 
nonrecursive  rule  applications  are  not  counted.  Since  the  minimum  model  coincides  with 
Ik  for  some  finite  k,  this  will  establish  the  lemma.  The  basis,  Jk  =  0,  is  immediate. 

Assume  that  (l)-(6)  hold  for  Ik-i-  We  shall  cite  items  of  this  inductive  hypothesis  as 
(1H)-(6H).  We  say  that  a  p  atom  reduces  to  a  chain  of  subgoals  if  some  substitution  unifies 
the  head  of  the  rule  with  that  p  atom  and  unifies  the  subgoals  of  the  rule  to  that  subgoal 
chain.  Also,  we  say  a  reduction  fails  on  a  subgoal  if  no  instance  of  that  subgoal  is  in  Ik-i- 
Clearly  h-i  C  Ik- 

(1)  If  p(z2,  Z)  is  derivable  in  k  steps,  then  it  must  reduce  to 

q^{x2,hi),p[hi,V),p{V,W),qi { W,  Z) 

where  h  is  a  gate  and  p{h4,V)  is  in  Ik-i-  If  h  is  an  or  gate,  no  qi  has  /i4  in  its  left 
block,  so  9o(h4,/i8)  is  the  only  possible  reduction.  If  h  is  an  and  gate,  then  V  must 
be  hS  by  (4H).  Either  way,  V  =  /i8  and  the  reduction  fails  on  p{hS,W)  by  (IH). 
It  follows  that  p(z2,  Z]  is  not  in  Ik  for  any  Z.  Similarly,  using  (2H),  p(z8,Z)  must 
reduce  to 

qi (z8,  z9) , p(z9, z2) , p(z2,  W),qi {W,  Z) 
which  fails  on  p(z2,  W). 

(2)  The  only  recursive  reduction  of  p(z9,  Z)  is 

qi{x9,  hS) , p(h8,  V), p( V,  W) , 9i {W,  Z) 

which  fails  on  p{h8,V),  by  (iH).  There  is  no  recursive  reduction  at  all  for  p(z7,Z). 
Therefore  the  nonrecursive  reductions  to  go(z9,  z2)  and  9o(®7,zO)  must  be  used. 


(3)  Using  (6H)  and  (2H),  p{xl,Z)  must  reduce  to 

91  (xl,  /0),p(/0.  /9),p(/9,  /2),  gi(/2,  Z) 

where  /  is  the  left  input  of  and  gate  x.  But  by  the  requirements  of  class  Cj,  the  left 
output  of  gate  /  connects  only  to  x,  and  the  right  output  of  /  cannot  connect  to  any 
gate’s  left  input,  so  x  is  the  only  gate  with  left  input  /.  Therefore  the  only  instamce 
of  9i(/2,  2^)  in  the  EDB  is  Z  =  x4. 

(4)  Using  (2H)  and  (6H),  p(x4,Z)  must  reduce  to 

9i  (x4,  gl) ,  p(ff  7 ,  pO) ,  p(sO,  g9) ,  qi  (p9,  x8) 

where  g  is  the  right  input  of  and  gate  x.  Here  we  used  the  fact  that  x  is  the  only 
gate  with  right  input  g,  by  membership  in  Ct. 

(5)  Using  (6H)  and  (2H),  p(xl,Z)  must  reduce  to 

(xl,  /O) ,  p(/0,  /9) ,  p(/9,  /2) ,  9x  (/2,  x4) 
where  /  is  an  input  of  or  gate  x. 

(6)  Using  (3H)  and  (5H),  p(xO,  Z)  must  reduce  to 

9i(xO,x1),p(x1,x4),p(x4,W'),9i(H",Z) 

Then  if  x  is  an  and  gate,  by  (4H),  W  =  i8.  However,  if  x  is  an  or  gate,  p(x4,  Z) 
must  reduce  to  9o(x4,x8),  so  that  again  W  =  x8.  It  follows  that  Z  =  x9.  | 

Theorem  9.3:  Pu  is  P-complete. 

Proof :  Reduction  from  Monotone  Circuit  Value  is  accomplished  by  first  transforming  the 
given  circuit  into  am  equivalent  Cj  circuit,  then  using  Tn  on  that  to  construct  an  EDB  for 
Pii-  The  previous  lemma  shows  that  the  circuit  outputs  a  logical  1  at  gate  x  if  and  only 
if  p(xO,  x9)  is  a  theorem  of  the  resulting  extended  logic  program.  | 

We  next  consider  the  elementary  single  rule  program  that  results  if  we  insert  an  extra 
subgoal  into  P4,  at  the  left  end. 

P12  p{X,Y):-p{X,T)MT,U),qr{U,V),p{V,W),q2{W,Y). 

p{X,Y)  qo{X,Y). 


which  is  succinctly  expressed  by  the  grammar  Gu  consisting  of  P  — ►  PP91P92  \  90. 


I  ^ ^  ^  w^jh ^Jf^y  ^j^j>  v^-v  v^m  * j/ ". v» ** ■* j "j  * 


-tr. 


y-:^- 

>2^ 


Definition  0.1:  We  say  that  a  recursive  subgoal  in  an  elementary  chain  rule  is  completely 
cut  off  if  there  ue  nonrecursive  goals  between  it  and  both  ends  of  the  subgoal  chain.  We 
say  that  a  recursive  subgoal  in  an  elementary  chain  rule  is  parti&IIy  cut  off  if  it  not  at 
either  end  of  the  subgoal  chain  and  the  subgoal  chain  contains  at  least  one  nonrecursive 
subgoal.  □ 

Observe  that  the  subgoal  chain  in  has  two  occurrences  of  p  that  are  cut  off  from 
the  end  of  the  chain,  one  partially  and  one  completely.  The  basic  idea  for  the  simulation 
of  a  circuit  is  to  expand  the  leftmost  p  subgoal  and  assign  constants  as  suggested  below. 

eO  fO  /9  fl  f2  e4  g3  gO  g9  e9 
9o  P  9o  ft  ft  9i  p  ft 

Here  we  assume  e  is  an  and-gate  with  left  and  right  inputs  /  and  g. 


EDB  contains: 

if  X  is: 

9o(zO,x9) 

a  1  gate 

qolxO,  fO) 

an  and  gate  with  left  input  / 

qo(xO,  fO) 

an  or  gate  with  either  input  / 

ft(zl,x2) 

any  gate 

<to(x4,g3) 

an  and  gate  with  right  input  g 

9o(x4,x5) 

an  or  gate 

ft  (16,  x7) 

an  or  gate 

gi{x3,x0) 

any  gate 

9i(x5,x6) 

an  or  gate 

gi(x9,xl) 

any  gate 

gi(xO,xl) 

an  and  or  an  or  gate 

92(x2,/i4) 

left  input  to  and  gate  h 

q2(x2,h4) 

either  input  to  or  gate  h 

q2(x7,x9) 

an  or  gate 

gj(x9,/i9) 

right  input  to  and  gate  h 

Figure  9:  EDB  produced  by  T12,  where  x  is  any  gate. 


Theorem  9.4:  Pu  is  /’-complete. 


Proof :  Define  transformation  T12  to  transform  a  circuit  C  €  Cj  into  the  EDB  described 
by  Fig.  9.  The  inductive  hypothesis  is: 


•  p(eO,  Z)  is  derivable  =»  Z  =  /O  or  Z  =  e4  or  Z  =  e9.  If  Z  =  e4,  then  p(/0, /9) 
is  derivable,  where  /  is  the  left  input  of  and-gate  e.  If  Z  =  e9,  then  p(eO,  e4)  is 
derivable,  and  p(ffO,  p9)  is  derivable,  where  g  is  the  right  input  of  and-gate  e.  (There 
are  obvious  adjustments  when  e  is  an  or-gate.) 

•  p(zl,Z)  is  derivable  =>  Z  =  i2. 

•  p(e4,  Z)  is  derivable  =>  Z  —  p3,  where  g  is  the  right  input  of  and-gate  e.  (Adjust  for 
or-gate.) 

•  No  instances  of  p(z2,  Z),  p(z3,  Z),  or  p(z9,  Z)  are  derivable. 

The  argument  follows  the  same  lines  as  for  Pn,  so  further  details  are  omitted.  | 
Coi^ecture  Any  elementary  single  rule  program  in  which  the  recursive  rule  has  two  or 
more  cut  off  recursive  subgoals,  at  least  one  of  them  completely  cut  off,  b  P-complete. 

The  ideas  used  on  Pn  and  Pd  appear  to  be  generalizable.  However,  a  word  of  caution 
is  in  order.  If  a  certain  template,  such  as  Pn,  is  .P -complete,  it  is  tempting  to  conclude 
that  more  complicated  programs,  in  which  Pn  can  be  embedded,  are  also  P-complete. 
However,  this  is  not  automatically  true;  it  needs  to  be  proved. 

9.2  Other  P-Complete  Single  Rule  Programs 

In  this  section  we  briefly  summarize  a  pot  pourri  of  ^-completeness  results  for  vairious 
logicad  query  programs. 

Path  Systems  [Coo74,JL76]  is  well  known  to  be  ^-complete,  and  is  in  fact  the  “first” 
P-complete  problem.  It  is  very  naturally  represented  as  the  logical  query  program 

P,x  riX):-h(X,Y,Z),r{Y),riZ). 
r{X)  s{X). 

where  s  and  h  are  EDB  relations  specifying  “sources”  and  “hyperedges,”  and  r  signifies 
“reachable.” 

Example  9.1:  Cosmadakis  and  Kanellakis  [CK85|  discovered  that  an  interesting  vau-iant 
of  Path  Systems,  called  the  blue-blooded  Frenchman  problem,  is  also  /’-complete: 

P22  r(X)  :-hi(X,Y),r(Y),h2(X,Z),r{Z). 
r(X)  .(X-). 

Essentially  this  means  that  we  can  require  h  to  be  the  equi-join  of  hi  and  /12  in  Path 
Systems.  An  easy  proof  for  Pjj  is  by  reduction  from  Monotone  Circuit  Value.  Let  C  be  a 
given  circuit  in  Ci. 

•  For  each  and  gate  e  with  inputs  /  and  g  we  put  hi{e,f)  and  hi{e,g)  in  the  EDB  to 
signify  that  e  has  /  as  its  left  input  and  g  as  its  right  input. 


•  For  each  or  gate  e  with  inputs  /  and  g  we  put  hi{e,f)  and  hi{e,g)  in  the  EDB  to 
signify  that  e  has  /  and  g  as  its  inputs,  and  we  put  in  /i2(e,e')  and  s(e')  to  allow 
derivations  to  be  completed. 

•  For  each  1  gate  e  we  put  s(c)  in  the  EDB. 

An  easy  induction  on  proof  length  shows  that  if  r{X)  is  true,  then  either  X  =  e'  for  some 
or  gate  e  6  C,  or  X  =  c  for  some  gate  e  E  C  that  outputs  a  1. 

A  more  intriguing  approach  is  to  simulate  a  scajining  Turing  machine^  directly  with  P21. 
The  main  idea  is  use  hi  to  verify  the  tape  symbol  and  use  hi  to  verify  the  state  transition 
in  such  a  way  that  r(X)  is  true  exactly  when  X  is  a  reachable  local  ID,  consisting  of  an 
encoding  of  (time,  head-position,  state,  symbol-scanned).  The  details  are  omitted,  as  the 
ideas  can  be  found  in  [Coo74],  where  it  is  shown  that  any  language  in  P  can  be  accepted 
by  some  scanning  Turing  machine.  □ 

Let  us  define  pure  recursion  as  the  use  of  recursive  rules  in  which  every  subgoal  is 
recursive.  For  example.  Pi  and  P2  employ  pure  recursion.  The  next  example  demonstrates 
that  pure  recursion  can  be  .P -complete. 

Example  9.2:  The  following  program  employs  pure  recursion  and  has  a  single  recursive 
rule  in  which  the  subgoals  represent  a  chain,  but  not  an  elementary  chain.  That  is,  the 
left  block  and  right  block  are  defined  for  each  literal,  but  different  literals  with  the  same 
predicate  symbol  have  different  block  structures. 

P23  p(W,X,y,Z)  :-p([/,W,U,V),p(V,X,V,Z). 
p(lV,X,y,Z)  :-go(W,X,y,Z). 

For  the  rule  head  auid  the  rightmost  subgoal,  the  left  block  is  eu^gument  1  and  the  right 
block  is  arguments  2,  3,  and  4.  But  for  the  left  subgoal,  the  left  block  is  argument  2  and 
the  right  block  is  argument  4;  arguments  1  and  3  are  in  neither  block. 

Proof  of  P -completeness  is  by  reduction  from  Path  Systems.  Given  an  EDB  for  P21, 
we  encode  go  as  the  cross  product,  s  x  /».  It  then  follows  that  the  minimum  model  of  P2S 
will  be  the  cross  product  p  =  r  x  h,  where  r  is  the  minimum  model  of  P21.  □ 

Our  final  example  has  the  interesting  property  that  it  is  P-complete  for  unrestricted 
EDBs,  yet  is  in  MC  for  EDBs  in  which  the  go  relation  is  “acyclic,”  in  the  sense  defined  in 
the  example. 

Example  9.3:  Consider  the  following  program,  called  Transitive  Closure  with  Permis¬ 
sions.  It  exhibits  a  nonchain  rule: 

P24  p(X,K)  :-gi{X,y),p{X,U),p{U,y). 
p{X,y)  :-go{X,y). 

®  A  acanning  TM  scans  the  tape  in  the  fixed  sequence  0,  1,  0,  -1,  0,  1,  2,  1,  0,  -1,  -2,  ... 


The  proof  of  ^-completeness  for  unrestricted  EDBs  is  by  reduction  from  Monotone  Circuit 
Value  for  circuits  in  Ci.  To  sketch  the  main  idea,  we  want  and  gate  e  with  inputs  /  and 
g  to  output  a  1  if  and  only  if  p(eO,  e9)  is  derivable,  and  we  wauit  p(eO,  e9)  to  be  derivable 
only  if  both  p(e0,e4)  zmd  p(e4,e9)  are  derivable.  Thtis  we  put  9i(e0,e9)  in  the  EDB.  By 
judicious  choice  of  gi  and  EDB  facts  we  ensure  that 

•  p(eO,  e4)  is  derivable  if  and  only  if  p(/0,  /9)  is  derivable; 

•  p(e4,e9)  is  derivable  if  and  only  if  p(g0,g9)  is  derivable. 

Additional  details  may  be  worked  out  by  the  interested  reader. 

It  is  also  easy  to  see  that  P}4  is  in  A/ C  when  the  EDB  is  restricted  to  include  any  gi 
relation,  but  only  relations  that  define  the  edges  of  a  directed  acyclic  graph.  For  the  go 
atoms  in  the  fringe  of  any  complete  derivation  tree  form  a  chain,  and  constitute  a  constant 
fraction  of  all  atoms  in  the  fringe.  Thus  P24  has  the  Polynomial  Fringe  Property  when 
restricted  to  this  class  of  EDBs.  □ 

10  Conclusion  and  Open  Problems 

We  have  defined  the  Polynomial  Fringe  Property  for  logical  query  progr2uns  and  shown 
that  programs  with  this  property  are  in  MC.  For  programs  with  elementary  chain  rules, 
the  GSM  Mapping  Theorem  gives  a  sufficient  condition  that  they  have  the  Polynomial 
Fringe  Property,  in  terms  of  GSM  mappings  of  Pi$,  the  Dyck  language  on  one  kind  of 
parentheses,  with  an  end-marker  appended. 

Our  applications  of  the  GSM  Mapping  Theorem  (see  P4,  Ps,  Pr)  were  rather  ad  hoe: 
We  first  characterized  the  language,  then  found  a  GSM  to  generate  it,  and  a  deterministic 
one  at  that.  Is  there  some  interesting  class  of  grammars  for  which  we  C2m  go  directly  to 
GSM’s?  Can  we  use  non-determinism? 

In  Section  9,  we  conjectured  that  two  cut  off  recursive  subgoals  in  an  elementary  single 
rule  program  ensures  P-completeness.  Is  this  conjecture  true? 

Bottom-up  evaluation  of  the  entire  minimum  model  of  a  logical  query  program  is  not 
usually  considered  practical.  The  application  of  the  techniques  in  this  paper  to  a  mixed 
bottom-up  and  top-down  strategy  that  allows  a  high  degree  of  parallelism,  yet  only  works 
on  “somewhat  relevant”  portions  of  the  minimum  model,  represents  a  significant  open 
problem. 
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